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Cellular Expression of P 2 AR-(3galAa Fusion Protein in C2 Clones 

(measured by anti-g-ga! ELISA) 
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FIGURE 1A 



Cellular expression of pArr2-pgalAco fusion protein in C2 clones 

(measured by anti-p gal ELISA) 
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FIGURE 1B 
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Agonist Stimulated cAMP Response in C2 Cells Expressing 32AR-pgalAa 
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FIGURE 2 



0-galactosidase Complementation as a Measurement for p2AR-(3galAa 
interacting with pArrestin2-(3galAco upon agonist Stimulation 
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FIGURE 3A 
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(3-galactosidase Complementation as a Measurement for p2AR-pgalAa 
Interaction with (3Arrestin1-pgalAco upon Agonist Stimulation 
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FIGURE 3B 
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(3-galactosidase Activity in Response to Agonist in C2 Cells 
Coexpressing (32AR-PgalAa and (3Arrestin2-(3galAco Fusion Prote 
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FIGURE 4A 



(3-galactosidase Activity in Response to Agonist in C2 Cells 
Coexpressing |32AR-[3galAa and pArrestinl-pgalAco Fusion Proteins 
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FIGURE 4B 



Inhibition of p-galactosidase activity in C2 Cells Coexpressing 
(32AR-(3galAa and f3Arrestin2-(3galAco Fusion Proteins 
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FIGURE 5A 




Antagonist Inhibition of (5-galactosidase Activity in C2 Cells 
Coexpressing (32AR-(3galAcc and (3Arrestin1-(3galAco Fusion Proteins 
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Figure 5B 



Agonist Stimulated cAMP Response in Clones or Pools of C2 Cells 
Coexpressing A2aR-PgalAa and (3Arrestin1-pgalAco Fusion Proteins 
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FIGURE 6 



Agonist Stimulated cAMP Response in Clones or Pools of C2 Cells 
Expressing D1-(3galAa and (3Arrestin2-PgalAco Fusion Proteins 
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FIGURE 7 



P 2 AR-3galAco and parr2-pgalAa Interaction in HEK293 
Clones in Response to Isoproterenol Treatment (1 |iM) 
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FIGURE 8A 



P2AR-pgalAct and PArrl-PgalA CO Interaction in a CHO Pool 

in Response to Isoproterenol Treatment(IOuM) 
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FIGURE 8B 



p2AR-pgalAa and pArr2-pgalAco Interaction in CHW Clone 
in Response to Isoproterenol Treatment (10uM) 
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FIGURE 8C 



p-galactosidase Complementation as a Measurement for 
Adrenergic Receptor Homodimerization in HEK 293 Cells 

Coexpressing (32AR-pgalAcc and (32AR-pgalAco. 
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Agonist Stimulated cAMP Response in HEK 293 Cells 
Coexpressing p2AR-|3galAa and (32AR-pgalA© 
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FIGURE 9B 
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Figure 10A 



I CTGCAGCCTG AATATGGGCC AAACAGGATA TCTGTGGTAA GCAGTTCCTG 
GACGTCGGAC TTATACCCGG TTTGTCCTAT AGACACCATT CGTCAAGGAC 



51 


CCCCGGCTCA 
GGGGCCGAGT 


GGGCCAAGAA 
CCCGGTTCTT 


CAGATGGAAC 
GTCTACCTTG 


AGCTGAATAT 
TCGACTTATA 


GGGCCAAACA 
CCCGGTTTGT 


101 


GGATATCTGT 
CCTATAGACA 


GGTAAGCAGT 
CCATTCGTCA 


TCCTGCCCCG 
AGGACGGGGC 


GCTCAGGGCC 
CGAGTCCCGG 


AAGAACAGAT 
TTCTTGTCTA 


151 


GGTCCCCAGA 
CCAGGGGTCT 


TGCGGTCCAG 
ACGCCAGGTC 


CCCTCAGCAG 
GGGAGTCGTC 


TTTCTAGAGA 
AAAGATCTCT 


ACCATCAGAT 
TGGTAGTCTA 


201 


GTTTCCAGGG 
CAAAGGTCCC 


TGCCCCAAGG 
ACGGGGTTCC 


ACCTGAAATG 
TGGACTTTAC 


ACCCTGTGCC 
TGGGACACGG 


TTATTTGAAC 
AATAAACTTG 


251 


TAACCAATCA 
ATTGGTTAGT 


GTTCGCTTCT 
CAAGCGAAGA 


CGCTTCTGTT 
GCGAAGACAA 


CGCGCGCTTC 
GCGCGCGAAG 


TGCTCCCCGA 
ACGAGGGGCT 


1,1 ; 


GCTCAATAAA 
CGAGTTATTT 


AGAGCCCACA 
TCTCGGGTGT 


ACCCCTCACT 
TGGGGAGTGA 


CGGGGCGCCA 
GCCCCGCGGT 


GTCCTCCGAT 
CAGGAGGCTA 


i 3 : 

531 

- ! 3 


TGACTGAGTC 
ACTGACTCAG 


GCCCGGGTAC 
CGGGCCCATG 


CCGTGTATCC 
GGCACATAGG 


AATAAACCCT 
TTATTTGGGA 


CTTGCAGTTG 
GAACGTCAAC 


Wbi 


CATCCGACTT 
GTAGGCTGAA 


GTGGTCTCGC 
CACCAGAGCG 


TGTTCCTTGG 
ACAAGGAACC 


GAGGGTCTCC 
CTCCCAGAGG 


TCTGAGTGAT 
AGACTCACTA 




TGACTACCCG 
ACTGATGGGC 


TCAGCGGGGG 
AGTCGCCCCC 


TCTTTCATTT 
AGAAAGTAAA 


GGGGGCTCGT 
CCCCCGAGCA 


CCGGGATCGG 
GGCCCTAGCC 


: 


GAGACCCCTG 
CTCTGGGGAC 


CCCAGGGACC 
GGGTCCCTGG 


ACCGACCCAC 
TGGCTGGGTG 


CACCGGGAGG 
GTGGCCCTCC 


CAAGCTGGCC 
GTTCGACCGG 


5%i 


AGCAACTTAT 
TCGTTGAATA 


CTGTGTCTGT 
GACACAGACA 


CCGATTGTCT 
GGCTAACAGA 


AGTGTCTATG 
TCACAGATAC 


ACTGATTTTA 
TGACTAAAAT 


601 


TGCGCCTGCG 
ACGCGGACGC 


TCGGTACTAG 
AGCCATGATC 


TTAGCTAACT 
AATCGATTGA 


AGCTCTGTAT 
TCGAGACATA 


CTGGCGGACC 
GACCGCCTGG 


651 


CGTGGTGGAA 
GCACCACCTT 


CTGACGAGTT 
GACTGCTCAA 


CTGAACACCC 
GACTTGTGGG 


GGCCGCAACC 
CCGGCGTTGG 


CTGGGAGACG 
GACCCTCTGC 


701 


TCCCAGGGAC 
AGGGTCCCTG 


TTTGGGGGCC 
AAACCCCCGG 


GTTTTTGTGG 
CAAAAACACC 


CCCGACCTGA 
GGGCTGGACT 


GGAAGGGAGT 
CCTTCCCTCA 


751 


CGATGTGGAA 
GCTACACCTT 


TCCGACCCCG 
AGGCTGGGGC 


TCAGGATATG 
AGTCCTATAC 


TGGTTCTGGT 
ACCAAGACCA 


AGGAGACGAG 
TCCTCTGCTC 


801 


AACCTAAAAC 
TTGGATTTTG 


AGTTCCCGCC 
TCAAGGGCGG 


TCCGTCTGAA 
AGGCAGACTT 


TTTTTGCTTT 
AAAAACGAAA 


CGGTTTGGAA 
GCCAAACCTT 


851 


CCGAAGCCGC 
GGCTTCGGCG 


GCGTCTTGTC 
CGCAGAACAG 


TGCTGCAGCA 
ACGACGTCGT 


TCGTTCTGTG 
AGCAAGACAC 


TTGTCTCTGT 
AACAGAGACA 


901 


CTGACTGTGT 
GACTGACACA 


TTCTGTATTT 
AAGACATAAA 


GTCTGAAAAT 
CAGACTTTTA 


TAGGGCCAGA 
ATCCCGGTCT 


CTGTTACCAC 
GACAATGGTG 



FIGURE 10B 
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TOCCTTAAGT 
AGGGAATTCA 
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AACTGGAATC 
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cattgacctt 


AGATGTCGAG 
TCTACAGCTC 
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ACAACCAGTC 
TGTTGGTCAG 


GGTAGATGTC 
CCATCTACAG 
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TTCTTCTCTG 


GTTGGGTTAC 
CAACCCAATG 
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GAAGACGAGA 
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GCAGAATGGC 
CGTCTTACCG 


CAACC.TTTAA 
GTTGGAAATT 


CGTCGGATGG 
GCAGCCTACC 


CCGCGAGACG 
GGCGCTCTGC 


GCACCTTTAA 
CGTGGAAATT 
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CCGAGACCTC 
GGCTCTGGAG 


ATCACCCAGG 
TAGTGGGTCC 


TTAAGATCAA 
AATTCTAGTT 


GGTCTTTTCA 
CCAGAAAAGT 


CCTGGCCCGC 
GGACCGGGCG 


1151 


ATGGACACCC 
TACCTGTGGG 


AGACCAGGTC 
TGTGGTCCAG 


CCCTACATCG 
GGGATGTAGC 


TGACCTGGGA 
ACTGGACCCT 


AGCCTTGGCT 
TCGGAACCGA 
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TTTGACCCCC 
AAACTGGGGG 


CTCCCTGGGT 
GAGGGACCCA 


CAAGCCCTTT 
GTTCGGGAAA 


GTACACCCTA 
CATGTGGGAT 


AGCCTCCGCC 
TCGGAGGCGG 
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TCCTCTTCCT 
AGGAGAAGGA 
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CGTCTCTCCC 
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CCTTGAACCT 
GGAACTTGGA 


CCTCGTTCGA 
GGAGCAAGCT 
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GGCCGCTCTA 
CCGGCGAGAT 


GCCCATTAAT 
CGGGTAATTA 


ACGACTCACT 
TGCTGAGTGA 


ATAGGGCGAT 
TATCCCGCTA 


TCGAATCAGG 
AGCTTAGTCC 
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CCTTGGCGCG 
GGAACCGCGC 


CCGGATCCTT 
GGCCTAGGAA 


AATTAAGCGC 
TTAATTCGCG 


AATTGGGAGG 
TTAACCCTCC 


TGGCGGTAGC 
ACCGCCATCG 
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DSL 
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CTCGAGATGG 
GAGCTCTACC 


GCGTGATTAC 
CGCACTAATG 


GGATTCACTG 
CCTAAGTGAC 


GCCGTCGTGG 
CGGCAGCACC 


CCCGCACCGA 
GGGCGTGGCT 
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R P S 


Q Q L R S L N 


G E W 


R F A 


1501 


TCGCCCTTCC 
AGCGGGAAGG 


CAACAGTTAC 
GTTGTCAATG 


GCAGCCTGAA 
CGTCGGACTT 


TGGCGAATGG 
ACCGCTTACC 


CGCTTTGCCT 
GCGAAACGGA 


+ 2 


W F P A 


PEA 


V P E S 


; W L E 


C D L 


1551 


GGTTTCCGGC 
CCAAAGGCCG 


ACCAGAAGCG 
TGGTCTTCGC 


GTGCCGGAAA 
CACGGCCTTT 


GCTGGCTGGA 
CGACCGACCT 


GTGCGATCTT 
CACGCTAGAA 


+ 2 


PEA! 


D T V V 


VPS 


N W Q M H G Y 


1601 


CCTGAGGCCG 
GGACTCCGGC 


ATACTGTCGT 
TATGACAGCA 


CGTCCCCTCA 
GCAGGGGAGT 


AACTGGCAGA 
TTGACCGTCT 


TGCACGGTTA 
ACGTGCCAAT 
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DAP 


I Y T N V T Y 


PIT 


V N P 



1651 CGATGCGCCC ATCTACACCA ACGTGACCTA TCCCATTACG GTCAATCCGC 
GCTACGCGGG TAGATGTGGT TGCACTGGAT AGGGTAATGC CAGTTAGGCG 



+2 P F V P TEN PTGC YSL T F N 



1701 CGTTTGTTCC CACGGAGAAT CCGACGGGTT GTTACTCGCT CACATTTAAT 
GCAAACAAGG GTGCCTCTTA GGCTGCCCAA CAATGAGCGA GTGTAAATTA 



+ 2VDES WLQ EGQ TRII FDG 



1751 GTTGATGAAA GCTGGCTACA GGAAGGCCAG ACGCGAATTA TTTTTGATGG 
CAACTACTTT CGACCGATGT CCTTCCGGTC TGCGCTTAAT AAAAACTACC 



+ 2 VNS A F H L WCN GRW VGY 



1801 CGTTAACTCG GCGTTTCATC TGTGGTGCAA CGGGCGCTGG GTCGGTTACG 
GCAATTGAGC CGCAAAGTAG ACACCACGTT GCCCGCGACC CAGCCAATGC 



+ 2GQDS RLP S E F D LSA FLR 



1851 GCCAGGACAG TCGTTTGCCG TCTGAATTTG ACCTGAGCGC ATTTTTACGC 
^ CGGTCCTGTC AGCAAACGGC AGACTTAAAC TGGACTCGCG TAAAAATGCG 



"™+2 AGEN RLA VMV LRWS DGS 
' =33 4 

lUl GCCGGAGAAA ACCGCCTCGC GGTGATGGTG CTGCGCTGGA GTGACGGCAG 
J3 CGGCCTCTTT TGGCGGAGCG CCACTACCAC GACGCGACCT CACTGCCGTC 

SH+2 YLE DQDM WRM S GI FRD 

1^51 TTATCTGGAA GATCAGGATA TGTGGCGGAT GAGCGGCATT TTCCGTGACG 
JL AATAGACCTT CTAGTCCTAT ACACCGCCTA CTCGCCGTAA AAGGCACTGC 

i 51 

^+2 V SLL HKP TTQI SDF H V A 

2 (pi TCTCGTTGCT GCATAAACCG ACTACACAAA TCAGCGATTT CCATGTTGCC 
S~ AGAGCAACGA CGTATTTGGC TGATGTGTTT AGTCGCTAAA GGTACAACGG 

^ « ■ III 

'+2TRFN DDF SRA VLEA EVQ 



2051 ACTCGCTTTA ATGATGATTT CAGCCGCGCT GTACTGGAGG CTGAAGTTCA 
TGAGCGAAAT TAG TACT AAA GTCGGCGCGA CATGACCTCC GACTTCAAGT 



+ 2 MCG ELRD YLR VTV SLW 



2101 GATGTGCGGC GAGTTGCGTG ACTACCTACG GGTAACAGTT TCTTTATGGC 
CTACACGCCG CTCAACGCAC TGATGGATGC CCATTGTCAA AGAAAT AC CG 



+ 2QGET Q V A SGTA PFG GEI 



2151 AGGGTGAAAC GCAGGTCGCC AGCGGCACCG CGCCTTTCGG CGGTGAAATT 
TCCCACTTTG CGTCCAGCGG TCGCCGTGGC GCGGAAAGCC GCCACTTTAA 



+ 2IDER GGY ADR VTLR LNV 



2201 ATCGATGAGC GTGGTGGTTA TGCCGATCGC GTCACACTAC GTCTGAACGT 
TAGCTACTCG CACCACCAAT ACGGCTAGCG CAGTGTGATG CAGACTTGCA 



+ 2 ENP KLWS A E I PNL YRA 



22 51 CGAAAACCCG AAACTGTGGA GCGCCGAAAT CCCGAATCTC TATCGTGCGG 
GCTTTTGGGC TTTGACACCT CGCGGCTTTA GGGCTTAGAG ATAGCACGCC 
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V V £ L 


H T A 


D G T 


L I E A 


E A C 


2301 


TGGTTGAACT 
ACCAACTTGA 


GCACACCGCC 
CGTGTGGCGG 


GACGGCACGC 
CTGCCGTGCG 


TGATTGAAGC 
ACTAACTTCG 


AGAAGCCTGC 
TCTTCGGACG 
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D V G 1 


F REV 


R I E 


N G L ] 


L L L N 


2351 


GATGTCGGTT 
CTACAGCCAA 


TCCGCGAGGT 
AGGCGCTCCA 


GCGGATTGAA 
CGCCTAACTT 


AATGGTCTGC 
TTACCAGACG 


TGCTGCTGAA 
ACGACGACTT 


+ 2 


G K P 


L L I R G V N 


R H E 


HHP 




GCCGTTCGGC 


TTGCTGATTC 
AACGACTAAG. 


GAGGCGTTAA 
CTCCGCAATT 


CCGTCACGAG 
GGCAGTGCTC 


CATCATCCTC 
GTAGTAGGAG 



+ 2LHGQ VMD EQTM V Q - D ILL 

2 4 51 TGCATGGTCA GGTCATGGAT GAGCAGACGA TGGTGCAGGA TATCCTGCTG 
ACGTACCAGT CCAGTACCTA CTCGTCTGCT ACCACGTCCT ATAGGACGAC 
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2gfl ATGAAGCAGA ACAACTTTAA CGCCGTGCGC TGTTCGCATT ATCCGAACCA 
TACTTCGTCT TGTTGAAATT GCGGCACGCG ACAAGCGTAA TAGGCTTGGT 



PLW YTLC D 



V V 



2mi TCCGCTGTGG TACACGCTGT GCGACCGCTA CGGCCTGTAT GTGGTGGATG 
5 AGGCGACACC ATGTGCGACA CGCTGGCGAT GCCGGACATA CACCACCTAC 
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2§R1 AAGCCAATAT TGAAACCCAC GGCATGGTGC CAATGAATCG TCTGACCGAT 
z: TTCGGTTATA ACTTTGGGTG CCGTACCACG GTTACTTAGC AGACTGGCTA 
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V T R M V 



2 651 GATCCGCGCT GGCTACCGGC GATGAGCGAA CGCGTAACGC GAATGGTGCA 
CTAGGCGCGA CCGATGGCCG CTACTCGCTT GCGCATTGCG CTTACCACGT 



+ 2 RDR NHPS VII W 



N 



27 01 GCGCGATCGT AATCACCCGA GTGTGATCAT CTGGTCGCTG GGGAATGAAT 
CGCGCTAGCA TTAGTGGGCT CACACTAGTA GACCAGCGAC CCCTTACTTA 



+ 2SGHG ANH DALY RWI KSV 

27 51 CAGGCCACGG CGCTAATCAC GACGCGCTGT ATCGCTGGAT CAAATCTGTC 
GTCCGGTGCC GCGATTAGTG CTGCGCGACA TAGCGACCTA GTTTAGACAG 



+ 2 D P S R P V 



D 



2801 GATCCTTCCC GCCCGGTGCA GTATGAAGGC GGCGGAGCCG ACACCACGGC 
CTAGGAAGGG CGGGCCACGT CATACTTCCG CCGCCTCGGC TGTGGTGCCG 



+ 2 T D I ICP M YAR VDE DQP 

2851 CACCGATATT ATTTGCCCGA TGTACGCGCG CGTGGATGAA GACCAGCCCT 
GTGGCTATAA TAAACGGGCT ACATGCGCGC GCACCTACTT CTGGTCGGGA 
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TCCCGGCTGT 
AGGGCCGACA 


GCCGAAATGG 
CGGCTTTACC 


TCCATCAAAA 
AGGTAGTTTT 


AATGGCTTTC 
TTACCGAAAG 


GCTACCTGGA 
CGATGGACCT 
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2951 


GAGACGCGCC 
CTCTGCGCGG 


CGCTGATCCT 
GCGACTAGGA 


TTGCGAATAC 
AACGCTTATG 


GCCCACGCGA 
CGGGTGCGCT 


TGGGTAACAG 
ACCCATTGTC 
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L G G 


F A K Y W Q A 


F R Q 


Y P R 
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TCTTGGCGGT 
AGAACCGCCA 


TTCGCTAAAT 
AAGCGATTTA 


ACTGGCAGGC 
TGACCGTCCG 


GTTTCGTCAG 
CAAAGCAGTC 


TATCCCCGTT 
ATAGGGGCAA- 
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D W V D Q S L 
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TACAGGGCGG 
ATGTCCCGCC 


CTTCGTCTGG 
GAAGCAGACC 


GACTGGGTGG 
CTGACCCACC 


ATCAGTCGCT 
TAGTCAGCGA 


GATTAAATAT 
CTAATTTATA 
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DENG N P W 


SAY 


G G D F G D T 


3 i£fl 


GATGAAAACG 
CTACTTTTGC 


GCAACCCGTG 
CGTTGGGCAC 


GTCGGCTTAC 
CAGCCGAATG 


GGCGGTGATT 
CCGCCACTAA 


TTGGCGATAC 
AACCGCTATG 


■* 
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R Q F C 
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L V F 


ADR 




GCCGAACGAT 
GGGCTTGCTA 


CGCCAGTTCT 
GCGGTCAAGA 


GTATGAACGG 
CATACTTGCC 


TCTGGTCTTT 
AGACCAGAAA 


GCCGACCGCA 
CGGCTGGCGT 
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3251 TTCCGTTTAT CCGGGCAAAC CATCGAAGTG ACCAGCGAAT ACCTGTTCCG 
AAGGCAAATA GGCCCGTTTG GTAGCTTCAC TGGTCGCTTA TGGACAAGGC 



+ 2 HSD NELL HWM VAL DGK 



3301 TCATAGCGAT AACGAGCTCC TGCACTGGAT GGTGGCGCTG GATGGTAAGC 
AGTATCGCTA TTGCTCGAGG ACGTGACCTA CCACCGCGAC CTACCATTCG 



+ 2PLAS GEV PLDV APQ GKQ 



3351 CGCTGGCAAG CGGTGAAGTG CCTCTGGATG TCGCTCCACA AGGTAAACAG 
GCGACCGTTC GCCACTTCAC GG AG ACCT AC AGCGAGGTGT TCCATTTGTC 



+ 2LIEL PEL PQP ESAG QLW 



34 01 TTGATTGAAC TGCCTGAACT ACCGCAGCCG GAGAGCGCCG GGCAACTCTG 
AACTAACTTG ACGGACTTGA TGGCGTCGGC CTCTCGCGGC CCGTTGAGAC 



+ 2 LTV RVVQ PNA T AW SEA 



3451 GCTCACAGTA CGCGTAGTGC AACCGAACGC GACCGCATGG TCAGAAGCCG 
CGAGTGTCAT GCGCATCACG TTGGCTTGCG CTGGCGTACC AGTCTTCGGC 



+ 2GHIS AW-Q QWRL A E N L 3 V 



3501 GGCACATCAG CGCCTGGCAG CAGTGGCGTC TGGCGGAAAA CCTCAGTGTG 
CCGTGTAGTC GCGGACCGTC GTCACCGCAG ACCGCCTTTT GGAGTCACAC 



+ 2TLPA ASH AIP HLTT SEM 



3551 ACGCTCCCCG CCGCGTCCCA CGCCATCCCG CATCTGACCA CCAGCGAAAT 
TGCGAGGGGC GGCGCAGGGT GCGGTAGGGC GTAGACTGGT GGTCGCTTTA 



+ 2 DFC IELG NKR WQF NRQ 



3 601 GGATTTTTGC ATCGAGCTGG GTAATAAGCG TTGGCAATTT AACCGCCAGT 
CCTAAAAACG TAGCTCGACC CATTATTCGC AACCGTTAAA TTGGCGGTCA 



+ 2SGFL SQM WIGD KKQ LLT 



3 651 CAGGCTTTCT TTCACAGATG TGGATTGGCG ATAAAAAACA ACTGCTGACG 

^ GTCCGAAAGA AAGTGTCTAC ACCTAACCGC TATTTTTTGT TGACGACTGC 

Ci + 2 P L R D QFT RAP LDND I G V 

' "* ^™ mm ^» ^» ^» ^» ^» ^» ^» ^H- A 

3^01 CCGCTGCGCG ATCAGTTCAC CCGTGCACCG CTGGATAACG ACATTGGCGT 

yi GGCGACGCGC TAGTCAAGTG GGCACGTGGC GACCTATTGC TGTAACCGCA 



Ul+2 SEA TRID PNA WVE RWK 

37?1 AAGTGAAGCG ACCCGCATTG ACCCTAACGC CTGGGTCGAA CGCTGGAAGG 
!_ TTCACTTCGC TGGGCGTAAC TGGGATTGCG GACCCAGCTT GCGACCTTCC 



f™+2 AAGH YQA E A A L LQC TAD 



38]R1 CGGCGGGCCA TTACCAGGCC GAAGCAGCGT TGTTGCAGTG CACGGCAGAT 
F~ GCCGCCCGGT AATGGTCCGG CTTCGTCGCA ACAACGTCAC GTGCCGTCTA 



+ 2T LAD AVL ITT AHAW QHQ 



38 51 ACACTTGCTG ATGCGGTGCT GATTACGACC GCTCACGCGT GGCAGCATCA 
TGTGAACGAC TACGCCACGA CTAATGCTGG CGAGTGCGCA CCGTCGTAGT 



+ 2 GKT LFIS RKT YRI DGS 



3901 GGGGAAAACC TTATTTATCA GCCGGAAAAC CTACCGGATT GATGGTAGTG 
CCCCTTTTGG AATAAATAGT CGGCCTTTTG GATGGCCTAA CTACCATCAC 



+ 2GQMA ITV DVEV ASD TPH 



3951 GTCAAATGGC GATTACCGTT GATGTTGAAG TGGCGAGCGA TACACCGCAT 
CAGTTTACCG CTAATGGCAA CTACAACTTC ACCGCTCGCT ATGTGGCGTA 



+ 2 PARI GLN CQL A Q V A ERV 



4 001 CCGGCGCGGA TTGGCCTGAA CTGCCAGCTG GCGCAGGTAG CAGAGCGGGT 
GGCCGCGCCT AACCGGACTT GACGGTCGAC CGCGTCCATC GTCTCGCCCA 



+ 2 NWL GLGP QEN Y PD RLT 



4 051 AAACTGGCTC GGATTAGGGC CGCAAGAAAA CTATCCCGAC CGCCTTACTG 
TTTGACCGAG CCTAATCCCG GCGTTCTTTT GATAGGGCTG GCGGAATGAC 



+ 2 A A C F DRW DLPL SDM Y T P 



4101 


CCGCCTGTTT 
GGCGGACAAA 


TGACCGCTGG 
ACTGGCGACC 


GATCTGCCAT 
CTAGACGGTA 


TGTCAGACAT 
ACAGTCTGTA 


GTATACCCCG 
CATATGGGGC 


+ 2 


Y V F 1 


P SEN 


G L R 


C G T R E L N 


4151 


TACGTCTTCC 
ATGCAGAAGG 


CGAGCGAAAA 
GCTCGCTTTT 


CGGTCTGCGC 
GCCAGACGCG 


TGCGGGACGC 
ACGCCCTGCG 


GCGAATTGAA 
CGCTTAACTT 


+ 2 


Y G P 


H Q W R G D F 


Q F N 


I S R 


4201 


TTATGGCCCA 
AATACCGGGT 


CACCAGTGGC 
GTGGTCACCG 


GCGGCGACTT 
CGCCGCTGAA 


CCAGTTCAAC 
GGTCAAGTTG 


ATCAGCCGCT 
TAGTCGGCGA 


+ 2 


Y S Q Q 


Q L M 


E T S H R H L 


L H A 


4251 


ACAGTCAACA 
TGTCAGTTGT 


GCAACTGATG 
CGTTGACTAC 


GAAACCAGCC 
CTTTGGTCGG 


ATCGCCATCT 
TAGCGGTAGA 


GCTGCACGCG 
CGACGTGCGC 




E E G T W L N 


IDG 


F H M G I G G 


4ffl)l 


GAAGAAGGCA 
CTTCTTCCGT 


CA'TGGCTGAA 
GTACCGACTT 


TATCGACGGT 
ATAGCTGCCA 


TTCCATATGG 
AAGGTATACC 


GGATTGGTGG 
CCTAACCACC 


J H +2 


D D S 


W S P S V S A 


E F Q 


L S A 


4-Jii 

B 

! 3 


CGACGACTCC 
GCTGCTGAGG 


TGGAGCCCGT 
ACCTCGGGCA 


CAGTATCGGC 
GTCATAGCCG 


GGAATTGCAG 
CCTTAAGGTC 


CTGAGCGCCG 
GACTCGCGGC 


i 2 

j ^ 


G R Y H 


Y Q L 


V W C Q K R S 


D Y K 


4Rr©l 


GTCGCTACCA 
CAGCGATGGT 


TTACCAGTTG 
AATGGTCAAC 


GTCTGGTGTC 
CAGACCACAG 


AAAAAAGATC 
TTTTTTCTAG 


TG AC TAT AAA 
ACTGATATTT 


+ 2 


D E D L D H H 


H H H 


H R 




/I /I CI 

fl fi O ± 


GATGAGGACC 
CTACTCCTGG 


TCGACCATCA 
AGCTGGTAGT 


TCATCATCAT 
AGTAGTAGTA 


CACCGGTAAT 
GTGGCCATTA 


AATAGGTAGA 
TTATCCATCT 




TAAGTGACTG 
ATTCACTGAC 


ATTAGATGCA 
TAATCTACGT 


TTGATCCCTC 
AACTAGGGAG 


GACCAATTCC 
CTGGTTAAGG 


GGTTATTTTC 
CCAATAAAAG 




CACCATATTG 
GTGGTATAAC 


CCGTCTTTTG 
GGCAGAAAAC 


GCAATGTGAG 
CGTTACACTC 


GGCCCGGAAA 
CCGGGCCTTT 


CCTGGCCCTG 
GGACCGGGAC 


4 o U 1 


TCTTCTTGAC 
AGAAGAACTG 


GAGCATTCCT 
CTCGTAAGGA 


AGGGGTCTTT 
TCCCCAGAAA 


CCCCTCTCGC 
GGGGAGAGCG 


CAAAGGAATG 
GTTTCCTTAC 


fl DO 1 


CAAGGTCTGT 
GTTCCAGACA 


TGAATGTCGT 
ACTTACAGCA 


GAAGGAAGCA 
CTTCCTTCGT 


GTTCCTCTGG 
CAAGGAGACC 


AAGCTTCTTG 
TTCGAAGAAC 


4701 


AAGACAAACA 
TTCTGTTTGT 


ACGTCTGTAG 
TGCAGACATC 


CGACCCTTTG 
GCTGGGAAAC 


CAGGCAGCGG 
GTCCGTCGCC 


AACCCCCCAC 
TTGGGGGGTG 


4751 


CTGGCGACAG 
GACCGCTGTC 


GTGCCTCTGC 
CACGGAGACG 


GGCCAAAAGC 
CCGGTTTTCG 


CACGTGTATA 
GTGCACATAT 


AGATACACCT 
TCTATGTGGA 



4 801 GCAAAGGCGG CACAACCCCA GTGCCACGTT GTGAGTTGGA TAGTTGTGGA 
CGTTTCCGCC GTGTTGGGGT CACGGTGCAA CACTCAACCT ATCAACACCT 



4 851 AAGAGTCAAA TGGCTCTCCT CAAGCGTATT CAACAAGGGG CTGAAGGATG 
TTCTCAGTTT ACCGAGAGGA GTTCGCATAA GTTGTTCCCC GACTTCCTAC 



4 901 CCCAGAAGGT ACCCCATTGT ATGGGATCTG ATCTGGGGCC TCGGTGCACA 
GGGTCTTCCA TGGGGTAACA TACCCTAGAC TAGACCCCGG AGCCACGTGT 



4 951 TGCTTTACAT GTGTTTAGTC GAGGTTAAAA AACGTCTAGG CCCCCCGAAC 
ACGAAATGTA CACAAATCAG CTCCAATTTT TTGCAGATCC GGGGGGCTTG 



5001 CACGGGGACG TGGTTTTCCT TTGAAAAACA CGATGATAAT ACCATGATTG 
GTGCCCCTGC ACCAAAAGGA AACTTTTTGT GCTACTATTA TGGTACTAAC 



5051 


AACAAGATGG 
TTGTTCTACC 


ATTGCACGCA 
TAACGTGCGT 


GGTTCTCCGG 
CCAAGAGGCC 


CCGCTTGGGT 
GGCGAACCCA 


GGAGAGGCTA 
CCTCTCCGAT 


^ 1 


TTCGGCTATG 
AAGCCGATAC 


ACTGGGCACA 
TGACCCGTGT 


AC AG AC AAT C 
TGTCTGTTAG 


GGCTGCTCTG 
CCGACGAGAC 


ATGCCGCCGT 
TACGGCGGCA 


sifi 


GTTCCGGCTG 
CAAGGCCGAC 


TCAGCGCAGG 
AGTCGCGTCC 


GGCGCCCGGT 
CCGCGGGCCA 


TCTTTTTGTC 
AGAAAAACAG 


AAGACCGACC 
TTCTGGCTGG 


5lfll 


TGTCCGGTGC 
ACAGGCCACG 


CCTGAATGAA 
GGACTTACTT 


CTGCAGGACG 
GACGTCCTGC 


AGGCAGCGCG 
TCCGTCGCGC 


GCTATCGTGG 
CGATAGCACC 


52^1 


CTGGCCACGA 
GACCGGTGCT 


CGGGCGTTCC 
GCCCGCAAGG 


TTGCGCAGCT 
AACGCGTCGA 


GTGCTCGACG 
CACGAGCTGC 


TTGTCACTGA 
AACAGTGACT 


5 ft 1 

D 


AGCGGGAAGG 
TCGCCCTTCC 


GACTGGCTGC 
CTGACCGACG 


TATTGGGCGA 
ATAACCCGCT 


AGTGCCGGGG 
TCACGGCCCC 


CAGGATCTCC 
GTCCTAGAGG 



5351 TGTCATCTCA CCTTGCTCCT GCCGAGAAAG TATCCATCAT GGCTGATGCA 
ACAGTAGAGT GGAACGAGGA CGGCTCTTTC ATAGGTAGTA CCGACTACGT 



54 01 ATGCGGCGGC TGCATACGCT TGATCCGGCT ACCTGCCCAT TCGACCACCA 
TACGCCGCCG ACGTATGCGA ACTAGGCCGA TGGACGGGTA AGCTGGTGGT 



5451 AGCGAAACAT CGCATCGAGC GAGCACGTAC TCGGATGGAA GCCGGTCTTG 
TCGCTTTGTA GCGTAGCTCG CTCGTGCATG AGCCTACCTT CGGCCAGAAC 



5501 TCGATCAGGA TGATCTGGAC GAAGAGCATC AGGGGCTCGC GCCAGCCGAA 
AGCTAGTCCT ACTAGACCTG CTTCTCGTAG TCCCCGAGCG CGGTCGGCTT 



5551 CTGTTCGCCA GGCTCAAGGC GCGCATGCCC GACGGCGAGG ATCTCGTCGT 
GACAAGCGGT CCGAGTTCCG CGCGTACGGG CTGCCGCTCC TAGAGCAGCA 



5 601 GACCCATGGC GATGCCTGCT TGCCGAATAT CATGGTGGAA AATGGCCGCT 
CTGGGTACCG CTACGGACGA ACGGCTTATA GTACCACCTT TTACCGGCGA 



651 


TTTCTGGATT 
AAAGACCTAA 


CATCGACTGT 
GTAGCTGACA 


GGCCGGCTGG 
CCGGCCGACC . 


GTGTGGCGGA 
CACACCGCCT 


CCGCTATCAG 
GGCGATAGTC 


701 


GACATAGCGT 
CTGTATCGCA 


TGGCTACCCG 
ACCGATGGGC 


TGATATTGCT 
ACTATAACGA 


GAAGAGCTTG 
CTTCTCGAAC 


GCGGCGAATG 
CGCCGCTTAC 



57 51 GGCTGACCGC TTCCTCGTGC TTTACGGTAT CGCCGCTCCC GATTCGCAGC 
CCGACTGGCG AAGGAGCACG AAATGCCATA GCGGCGAGGG CTAAGCGTCG 



5801 


GCATCGCCTT 
CGTAGCGGAA 


CTATCGCCTT 
GATAGCGGAA 


CTTGACGAGT 
GAACTGCTCA 


TCTTCTGAGC 
AGAAGACTCG 


GGGACTCTGG 
CCCTGAGACC 


5851 


GGTTCGCATC 
CCAAGCGTAG 


GATAAAATAA 
CTATTTTATT 


AAGATTTTAT 
TTCTAAAATA 


TTAGTCTCCA 
AATCAGAGGT 


GAAAAAGGGG 
CTTTTTCCCC 


5901 


GGAATGAAAG 
CCTTACTTTC 


ACCCCACCTG 
TGGGGTGGAC 


TAGGTTTGGC 
ATCCAAACCG 


AAGCTAGCTT 
TTCGATCGAA 


AAGTAACGCC 
TTCATTGCGG 


5951 


ATTTTGCAAG 
TAAAACGTTC 


GCATGGAAAA 
CGTACCTTTT 


ATACATAACT 
TATGTATTGA 


GAGAATAGAG 
CTCTTATCTC 


AAGTTCAGAT 
TTCAAGTCTA 


6001 


CAAGGTCAGG 
GTTCCAGTCC 


AACAGATGGA 
TTGTCTACCT 


ACAGCTGAAT 
TGTCGACTTA 


ATGGGCCAAA 
TACCCGGTTT 


CAGGATATCT 
GTCCTATAGA 


63% 1 

it.; r 
tl. a 


GTGGTAAGCA 
CACCATTCGT 


GTTCCTGCCC 
CAAGGACGGG 


CGGCTCAGGG 
GCCGAGTCCC 


CCAAGAACAG 
GGTTCTTGTC 


ATGGAACAGC 
TACCTTGTCG 




TGAATATGGG 
ACTTATACCC 


CCAAACAGGA 
GGTTTGTCCT 


TATCTGTGGT 
ATAGACACCA 


AAGCAGTTCC 
TTCGTCAAGG 


TGCCCCGGCT 
ACGGGGCCGA 


rsi ; 
s 5 : 
: ^-s 


CAGGGCCAAG 
GTCCCGGTTC 


AACAGATGGT 
TTGTCTACCA 


CCCCAGATGC 
GGGGTCTACG 


GGTCCAGCCC 
CCAGGTCGGG 


TCAGCAGTTT 
AGTCGTCAAA 




CTAGAGAACC 
GATCTCTTGG 


ATCAGATGTT 
TAGTCTACAA 


TCCAGGGTGC 
AGGTCCCACG 


CCCAAGGACC 
GGGTTCCTGG 


TGAAATGACC 
ACTTTACTGG 


62,51 

W ^ 
1 

£ ** 


CTGTGCCTTA 
GACACGGAAT 


TTTGAACTAA 
AAACTTGATT 


CCAATCAGTT 
GGTTAGTCAA 


CGCTTCTCGC 
GCGAAGAGCG 


TTCTGTTCGC 
AAGACAAGCG 


ebtii 


GCGCTTCTGC 
CGCGAAGACG 


TCCCCGAGCT 
AGGGGCTCGA 


CAATAAAAGA 
GTTATTTTCT 


GCCCACAACC 
CGGGTGTTGG 


CCTCACTCGG 
GGAGTGAGCC 


6351 


GGCGCCAGTC 
CCGCGGTCAG 


CTCCGATTGA 
GAGGCTAACT 


CTGAGTCGCC 
GACTCAGCGG 


CGGGTACCCG 
GCCCATGGGC 


TGTATCCAAT 
ACATAGGTTA 


6401 


AAACCCTCTT 
TTTGGGAGAA 


GCAGTTGCAT 
CGTCAACGTA 


CCGACTTGTG 
GGCTGAACAC 


GTCTCGCTGT 
CAGAGCGACA 


TCCTTGGGAG 
AGGAACCCTC 


6451 


GGTCTCCTCT 
CCAGAGGAGA 


GAGTGATTGA 
CTCACTAACT 


CTACCCGTCA 
GATGGGCAGT 


GCGGGGGTCT 
CGCCCCCAGA 


TTCATTCATG 
AAGTAAGTAC 


6501 


CAGCATGTAT 
GTCGTACATA 


CAAAATTAAT 
GTTTTAATTA 


TTGGTTTTTT 
AACCAAAAAA 


TTCTTAAGTA 
AAGAATTCAT 


TTTACATTAA 
AAATGTAATT 


6551 


ATGGCCATAG 
TACCGGTATC 


TTGCATTAAT 
AACGTAATTA 


GAATCGGCCA 
CTTAGCCGGT 


ACGCGCGGGG 
TGCGCGCCCC 


AGAGGCGGTT 
TCTCCGCCAA 


6601 

\J SJ w -A. 


i vj\_,o 1 t\ j_ loo 

ACGCATAACC 


GCGAGAAGGC 


GAAGGAGCGA 


V^AL. 1 UAL. itb 

GTGACTGAGC 


GACGCGAGCC 


6651 


TCGTTCGGCT 
AGCAAGCCGA 


GCGGCGAGCG 
CGCCGCTCGC 


GTATCAGCTC 
CATAGTCGAG 


ACTCAAAGGC 
TGAGTTTCCG 


GGTAATACGG 
CCATTATGCC 



5'MoMuLV LTR 



ColElori 



SB MoMuLV LTR 



1 F5 
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Figure 11A 



1 CTGCAGCCTG AATATGGGCC AAACAGGATA TCTGTGGTAA GCAGTT^^TG 
GACGTCGGAC TTATACCCGG TTTGTCCTAT AGACACCATT CGTCAAGGAC 



51 CCCCGGCTCA GGGCCAAGAA CAGATGGAAC AGCTGAATAT GGGCCAAACA 
GGGGCCGAGT CCCGGTTCTT GTCTACCTTG TCGACTTATA CCCGGTTTGT 



101 


CCTATAGACA 


CCATTCGTCA 


AGGACGGGGC 


GCTCAGGGCC 
CGAGTCCCGG 


AAGAACAGAT 
TTCTTGTCTA 


151 

4. •J .L 


n t p r* cc a r; zi 
CCAGGGGTCT 


ACGCCAGGTC 


GGGAGTCGTC 


TTTCTAGAGA 
AAAGATCTCT 


ACCATCAGAT 
TGGTAGTCTA 


201 


will LLnubu 

CAAAGGTCCC 


ACGGGGTTCC 


TGGACTTTAC 


ACCCTGTGCC 
TGGGACACGG 


TTATTTGAAC 
AATAAACTTG 


251 


TAACCAATCA 
ATTGGTTAGT 


GTTCGCTTCT 
CAAGCGAAGA 


CGCTTCTGTT 
GCGAAGACAA 


CGCGCGCTTC 
GCGCGCGAAG 


TGCTCCCCGA 
ACGAGGGGCT 




GCTCAATAAA 
CGAGTTATTT 


AGAGCCCACA 
TCTCGGGTGT 


ACCCCTCACT 
TGGGGAGTGA 


CGGGGCGCCA 
GCCCCGCGGT 


GTCCTCCGAT 
CAGGAGGCTA 


W 1 


TGACTGAGTC 
ACTGACTCAG 


GCCCGGGTAC 
CGGGCCCATG 


CCGTGTATCC 
GGCACATAGG 


AATAAACCCT 
TTATTTGGGA 


CTTGCAGTTG 
GAACGTCAAC 



fgl CATCCGACTT GTGGTCTCGC TGTTCCTTGG GAGGGTCTCC TCTGAGTGAT 
GTAGGCTGAA CACCAGAGCG ACAAGGAACC CTCCCAGAGG AGACTCACTA 



^ MBI — — m m j — — ^ j _ 

|51 TGACTACCCG TCAGCGGGGG TCTTTCATTT GGGGGCTCGT CCGGGATCGG 
W ACTGATGGGC AGTCGCCCCC AGAAAGTAAA CCCCCGAGCA GGCCCTAGCC 



5#1 GAGACCCCTG CCCAGGGACC ACCGACCCAC CACCGGGAGG CAAGCTGGCC 
yl CTCTGGGGAC GGGTCCCTGG TGGCTGGGTG GTGGCCCTCC GTTCGACCGG 



*i 


AGCAACTTAT 
TCGTTGAATA 


CTGTGTCTGT 
GACACAGACA 


CCGATTGTCT 
GGCTAACAGA 


AGTGTCTATG 
TC AC AG AT AC 


ACTGATTTTA 
TGACTAAAAT 


601 


TGCGCCTGCG 
ACGCGGACGC 


TCGGTACTAG 
AGCCATGATC 


TTAGCTAACT 
AATCGATTGA 


AGCTCTGTAT 
TCGAGACATA 


CTGGCGGACC 
GACCGCCTGG 


651 


CGTGGTGGAA 
GCACCACCTT 


CTGACGAGTT 
GACTGCTCAA 


CTGAACACCC 
GACTTGTGGG 


GGCCGCAACC 
CCGGCGTTGG 


CTGGGAGACG 
GACCCTCTGC 


701 


TCCCAGGGAC 
AGGGTCCCTG 


TTTGGGGGCC 
AAACCCCCGG 


GTTTTTGTGG 
CAAAAACACC 


CCCGACCTGA 
GGGCTGGACT 


GGAAGGGAGT 
CCTTCCCTCA 


751 


CGATGTGGAA 
GCTACACCTT 


TCCGACCCCG 
AGGCTGGGGC 


TCAGGATATG 
AGTCCTATAC 


TGGTTCTGGT 
ACCAAGACCA 


AGGAGACGAG 
TCCTCTGCTC 


801 


AACCTAAAAC 
TTGGATTTTG 


AGTTCCCGCC 
TCAAGGGCGG 


TCCGTCTGAA 
AGGCAGACTT 


TTTTTGCTTT 
AAAAACGAAA 


CGGTTTGGAA 
GCCAAACCTT 


851 


CCGAAGCCGC 
GGCTTCGGCG 


GCGTCTTGTC 
CGCAGAACAG 


TGCTGCAGCA 
ACGACGTCGT 


TCGTTCTGTG 
AGCAAGACAC 


TTGTCTCTGT 
AACAGAGACA 


901 


CTGACTGTGT 
GACTGACACA 


TTCTGTATTT 
AAGACATAAA 


GTCTGAAAAT 
CAGACTTTTA 


TAGGGCCAGA 
ATCCCGGTCT 


CTGTTACCAC 
GACAATGGTG 



FIGURE 11B 



951 


TCCCTTAAGT 
AGGGAATTCA 


TTGACCTTAG 
AACTGGAATC 


GTAACTGGAA 
CATTGACCTT 


AGATGTCGAG 
TCTACAGCTC 


CGGCTCGCTC 
GCCGAGCGAG 


1001 


ACAACCAGTC 
TGTTGGTCAG 


GGTAGATGTC 
CCATCTACAG 


AAGAAGAGAC 
TTCTTCTCTG 


GTTGGGTTAC 
CAACCCAATG 


CTTCTGCTCT 
GAAGACGAGA 


1051 


GCAGAATGGC 
CGTCTTACCG 


CAACCTTTAA 
GTTGGAAATT 


CGTCGGATGG 
GCAGCCTACC 


CCGCGAGACG 
GGCGCTCTGC 


GCACCTTTAA 
CGTGGAAATT 


1101 


CCGAGACCTC 
GGCTCTGGAG 


ATCACCCAGG 
TAGTGGGTCC 


TTAAGATCAA 
AATTCTAGTT 


GGTCTTTTCA 
CCAGAAAAGT 


CCTGGCCCGC 
GGACCGGGCG 


1151 


ATGGACACCC 
TACCTGTGGG 


AGACCAGGTC 
TCTGGTCCAG 


CCCTACATCG 
GGGATGTAGC 


TGACCTGGGA 
ACTGGACCCT 


AGCCTTGGCT 
TCGGAACCGA 


1201 


TTTGACCCCC 
AAACTGGGGG 


CTCCCTGGGT 
GAGGGACCCA 


CAAGCCCTTT 
GTTCGGGAAA 


GTACACCCTA 
CATGTGGGAT 


AGCCTCCGCC 
TCGGAGGCGG 


lSl 

u s 


TCCTCTTCCT 
AGGAGAAGGA 


CCATCCGCCC 
GGTAGGCGGG 


CGTCTCTCCC 
GCAGAGAGGG 


CCTTGAACCT 
GGAACTTGGA 


CCTCGTTCGA 
GGAGCAAGCT 


lifoi 


CCCCGCCTCG 
GGGGCGGAGC 


ATCCTCCCTT 
TAGGAGGGAA 


TATCCAGCCC 
ATAGGTCGGG 


TCACTCCTTC 
AGTGAGGAAG 


TCTAGGCGCC 
AGATCCGCGG 




GGCCGCTCTA 
CCGGCGAGAT 


GCCCATTAAT 
CGGGTAATTA 


ACGACTCACT 
TGCTGAGTGA 


ATAGGGCGAT 
TATCCCGCTA 


TCGAACACCA 
AGCTTGTGGT 


iioi 


TGCACCATCA 
ACGTGGTAGT 


TCATCATCAC 
AGTAGTAGTG 


GTCGACTATA 
CAGCTGATAT 


AAGATGAGGA 
TTCTACTCCT 


CCTCGAGATG 
GGAGCTCTAC 


X - 


GGCGTGATTA 
CCGCACTAAT 


CGGATTCACT 
GCCTAAGTGA 


GGCCGTCGTG 
CCGGCAGCAC 


GCCCGCACCG 
CGGGCGTGGC 


ATCGCCCTTC 
TAGCGGGAAG 




CCAACAGTTA 
GGTTGTCAAT 


CGCAGCCTGA 
GCGTCGGACT 


ATGGCGAATG 
TACCGCTTAC 


GCGCTTTGCC 
CGCGAAACGG 


TGGTTTCCGG 
ACCAAAGGCC 


1551 


CACCAGAAGC 
GTGGTCTTCG 


GGTGCCGGAA 
CCACGGCCTT 


AGCTGGCTGG 
TCGACCGACC 


AGTGCGATCT 
TCACGCTAGA 


TCCTGAGGCC 
AGGACTCCGG 


1601 


GATACTGTCG 
CTATGACAGC 


TCGTCCCCTC 
AGCAGGGGAG 


AAACTGGCAG 
TTTGACCGTC 


ATGCACGGTT 
TACGTGCCAA 


ACGATGCGCC 
TGCTACGCGG 


1651 


CATCTACACC 
GTAGATGTGG 


AACGTGACCT 
TTGCACTGGA 


ATCCCATTAC 
TAGGGTAATG 


GGTCAATCCG 
CCAGTTAGGC 


CCGTTTGTTC 
GGCAAACAAG 


1701 


CCACGGAGAA 
GGTGCCTCTT 


TCCGACGGGT 
AGGCTGCCCA 


TGTTACTCGC 
ACAATGAGCG 


TCACATTTAA 
AGTGTAAATT 


TGTTGATGAA 
ACAACTACTT 


1751 


AGCTGGCTAC 
TCGACCGATG 


AGGAAGGCCA 
TCCTTCCGGT 


GACGCGAATT 
CTGCGCTTAA 


ATTTTTGATG 
TAAAAACTAC 


GCGTTAACTC 
CGCAATTGAG 


1801 


GGCGTTTCAT 
CCGCAAAGTA 


CTGTGGTGCA 
GACACCACGT 


ACGGGCGCTG 
TGCCCGCGAC 


GGTCGGTTAC 
CCAGCCAATG 


GGCCAGGACA 
CCGGTCCTGT 


1851 


GTCGTTTGCC 
CAGCAAACGG 


GTCTGAATTT 
CAGACTTAAA 


GACCTGAGCG 
CTGGACTCGC 


CATTTTTACG 
GTAAAAATGC 


CGCCGGAGAA 
GCGGCCTCTT 



1001 AACCGCCTCG CGGTGATGGT GGTGCGCTGG AGTGACGGCA GTTATCTGGA 
TTGGCGGAGC GCCACTACCA CGACf^GACC TCACTGCCGT CAATAGACCT 



1951 


AGATCAGGAT 
TCTAGTCCTA 


ATGTGGCGGA 
TACACCGCCT 


TGAGCGGCAT 
ACTCGCCGTA 


TTTCCGTGAC 
AAAGGCACTG 


GTCTCGTTGC 
CAGAGCAACG 


2001 


TGCATAAACC 
ACGTATTTGG 


GACTACACAA 
CTGATGTGTT 


ATCAGCGATT 
TAGTCGCTAA 


TCCATGTTGC 
AGGTACAACG 


CACTCGCTTT 
GTGAGCGAAA 



2051 AATGATGATT TCAGCCGCGC TGTACTGGAG GCTGAAGTTC AGATGTGCGG 
TTACTACTAA AGTCGGCGCG ACATGACCTC CGACTTCAAG TCTACACGCC 



2101 


CGAGTTGCGT 
GCTCAACGCA 


GACTACCTAC 
CTGATGGATG 


GGGTAACAGT 
CCCATTGTCA 


TTCTTTATGG 
AAGAAATACC 


CAGGGTGAAA 
GTCCCACTTT 


2151 


CGCAGGTCGC 
GCGTCCAGCG 


CAGCGGCACC 
GTCGCCGTGG 


GCGCCTTTCG 
CGCGGAAAGC 


GCGGTGAAAT 
CGCCACTTTA 


TATCGATGAG 
ATAGCTACTC 


22^1 


CGTGGTGGTT 
GCACCACCAA 


ATGCCGATCG 
TACGGCTAGC 


CGTCACACTA 
GCAGTGTGAT 


CGTCTGAACG 
GCAGACTTGC 


TCGAAAACCC 
AGCTTTTGGG 


2&51 

leas? 
1 z 


GAAACTGTGG 
CTTTGACACC 


AGCGCCGAAA 
TCGCGGCTTT 


TCCCGAATCT 
AGGGCTTAGA 


CTATCGTGCG 
GATAGCACGC 


GTGGTTGAAC 
CACCAACTTG 


* 

2i?i 

a 


TGCACACCGC 
ACGTGTGGCG 


CGACGGCACG 
GCTGCCGTGC 


CTGATTGAAG 
GACTAACTTC 


CAGAAGCCTG 
GTCTTCGGAC 


CGATGTCGGT 
GCTACAGCCA 


2M1 

s'"" ; " 


TTCCGCGAGG 
AAGGCGCTCC 


TGCGGATTGA 
ACGCCTAACT 


AAATGGTCTG 
TTTACCAGAC 


CTGCTGCTGA 
GACGACGACT 


ACGGCAAGCC 
TGCCGTTCGG 


24.01 


GTTGCTGATT 
CAACGACTAA 


CGAGGCGTTA 
GCTCCGCAAT 


ACCGTCACGA 
TGGCAGTGCT 


GCATCATCCT 
CGTAGTAGGA 


CTGCATGGTC 
GACGTACCAG ' 


2&31 


AGGTCATGGA 
TCCAGTACCT 


TGAGCAGACG 
ACTCGTCTGC 


ATGGTGCAGG 
TACCACGTCC 


ATATCCTGCT 
TATAGGACGA 

* 


GATGAAGCAG 
CTACTTCGTC 


2501 


AACAACTTTA 
TTGTTGAAAT 


ACGCCGTGCG 
TGCGGCACGC 


CTGTTCGCAT 
GACAAGCGTA 


TATCCGAACC 
ATAGGCTTGG 


ATCCGCTGTG 
TAGGCGACAC 


2551 


GTACACGCTG 
CATGTGCGAC 


TGCGACCGCT 
ACGCTGGCGA 


ACGGCCTGTA 
TGCCGGACAT 


TGTGGTGGAT 
ACACCACCTA 


GAAGCCAATA 
CTTCGGTTAT 


2601 


TTGAAACCCA 
AACTTTGGGT 


CGGCATGGTG 
GCCGTACCAC 


CCAATGAATC 
GGTTACTTAG 


GTCTGACCGA 
CAGACTGGCT 


TGATCCGCGC 
ACTAGGCGCG 


2651 


TGGCTACCGG 
ACCGATGGCC 


CGATGAGCGA 
GCTACTCGCT 


ACGCGTAACG 
TGCGCATTGC 


CGAATGGTGC 
GCTTACCACG 


AGCGCGATCG 
TCGCGCTAGC 


2701 


TAATCACCCG 
ATTAGTGGGC 


AGTGTGATCA 
TCACACTAGT 


TCTGGTCGCT 
AGACCAGCGA 


GGGGAATGAA 
CCCCTTACTT 


TCAGGCCACG 
AGTCCGGTGC 



2751 GCGCTAATCA CGACGCGCTG TATCGCTGGA TCAAATCTGT CGATCCTTCC 
CGCGATTAGT GCTGCGCGAC ATAGCGACCT AGTTTAGACA GCTAGGAAGG 



2801 CGCCCGGTGC AGTATGAAGG CGGCGGAGCC GACACCACGG CCACCGATAT 
GCGGGCCACG TCATACTTCC GCCGCCTCGG CTGTGGTGCC GGTGGCTATA 



t 

2351 


TATTTGCCCG 
ATAAACGGGC 


ATGTACGCGC 
TACATGCGCG 


GCGTGGATGA 
CGCACCTACT 


AGACCAGCCC 
TCTGGTCGGG 


TTCCCGGCTG 
AAGGGCCGAC 


2901 


TGCCGAAATG 
ACGGCTTTAC 


GTCCATCAAA 
CAGGTAGTTT 


AAATGGCTTT 
TTTACCGAAA 


CGCTACCTGG 
GCGATGGACC 


AGAGACGCGC 
TCTCTGCGCG 


2951 


CCGCTGATCC 
GGCGACTAGG 


TTTGCGAATA 
AAACGCTTAT 


CGCCCACGCG 
GCGGGTGCGC 


ATGGGTAACA 
TACCCATTGT 


GTCTTGGCGG 
CAGAACCGCC 


3001 


TTTCGCTAAA 
AAAGCGATTT 


TACTGGCAGG 
ATGACCGTCC 


CGTTTCGTCA 
GCAAAGCAGT 


GTATCCCCGT 
CATAGGGGCA 


TTACAGGGCG 
AATGTCCCGC 


3051 


GCTTCGTCTG 
CGAAGCAGAC 


GGACTGGGTG 
CCTGACCCAC 


GATCAGTCGC 
CTAGTCAGCG 


TGATTAAATA 
ACTAATTTAT 


TGATGAAAAC ■ ■ 
ACTACTTTTG 


3101 


GGCAACCCGT 
CCGTTGGGCA 


GGTCGGCTTA 
CCAGCCGAAT 


CGGCGGTGAT 
GCCGCCACTA 


TTTGGCGATA 
AAACCGCTAT 


CGCCGAACGA 
GCGGCTTGCT 


yn 


TCGCCAGTTC 
AGCGGTCAAG 


TGTATGAACG 
ACATACTTGC 


GTCTGGTCTT 
CAGACCAGAA 


TGCCGACCGC 
ACGGCTGGCG 


ACGCCGCATC 
TGCGGCGTAG 


CAGCGCTGAC 
GTCGCGACTG 


GGAAGCAAAA 
CCTTCGTTTT 


CACCAGCAGC 
GTGGTCGTCG 


AGTTTTTCCA 
TCAAAAAGGT 


GTTCCGTTTA 
CAAGGCAAAT 


32S|1 

HI 


TCCGGGCAAA 
AGGCCCGTTT 


CCATCGAAGT 
GGTAGCTTCA 


GACCAGCGAA 
CTGGTCGCTT 


TACCTGTTCC 
ATGGACAAGG 


GTCATAGCGA 
CAGTATCGCT 


3^0,1 


TAACGAGCTC 
ATTGCTCGAG 


CTGCACTGGA 
GACGTGACCT 


TGGTGGCGCT 
ACCACCGCGA 


GGATGGTAAG 
CCTACCATTC 


CCGCTGGCAA 
GGCGACCGTT 


3igi 


GCGGTGAAGT 
CGCCACTTCA 


GCCTCTGGAT 
CGGAGACCTA 


GTCGCTCCAC 
CAGCGAGGTG 


AAGGTAAACA 
TTCCATTTGT 


GTTGATTGAA 
CAACTAACTT 




CTGCCTGAAC 
GACGGACTTG 


TACCGCAGCC 
ATGGCGTCGG 


GGAGAGCGCC 
CCTCTCGCGG 


GGGCAACTCT 
CCCGTTGAGA 


GGCTCACAGT 
CCGAGTGTCA 


3451 


ACGCGTAGTG 
TGCGCATCAC 


CAACCGAACG 
GTTGGCTTGC 


CGACCGCATG 
GCTGGCGTAC 


GTCAGAAGCC 
CAGTCTTCGG 


GGGCACATCA 
CCCGTGTAGT 


3501 


GCGCCTGGCA 
CGCGGACCGT 


GCAGTGGCGT 
CGTCACCGCA 


CTGGCGGAAA 
GACCGCCTTT 


ACCTCAGTGT 
TGGAGTCACA 


GACGCTCCCC 
CTGCGAGGGG 


3551 


GCCGCGTCCC 
CGGCGCAGGG 


ACGCCATCCC 
TGCGGTAGGG 


GCATCTGACC 
CGTAGACTGG 


ACCAGCGAAA 
TGGTCGCTTT 


TGGATTTTTG 
ACCTAAAAAC 


3601 


CATCGAGCTG 
GTAGCTCGAC 


GGTAATAAGC 
CCATTATTCG 


GTTGGCAATT 
CAACCGTTAA 


TAACCGCCAG 
ATTGGCGGTC 


TCAGGCTTTC 
AGTCCGAAAG 


3651 


TTTCACAGAT 
AAAGTGTCTA 


GTGGATTGGC 
CACCTAACCG 


GATAAAAAAC 
CTATTTTTTG 


AACTGCTGAC 
TTGACGACTG 


GCCGCTGCGC 
CGGCGACGCG 


3701 


GATCAGTTCA 
CTAGTCAAGT 


CCCGTGCACC 
GGGCACGTGG 


GCTGGATAAC 
CGACCTATTG 


GACATTGGCG 
CTGTAACCGC 


TAAGTGAAGC 
ATTCACTTCG 


3751 


GACCCGCATT 
CTGGGCGTAA 


GACCCTAACG 
CTGGGATTGC 


CCTGGGTCGA 
GGACCCAGCT 


ACGCTGGAAG 
TGCGACCTTC 


GCGGCGGGCC 
CGCCGCCCGG 



< 
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3801 


ATTACCAGGC 
TAATGGTCCG 


CGAAGCAGCG 
GCTTCGTCGC 


TTGTTGCAGT 
AACAACGTCA 


GCACGGCAGA 
CGTGCCGTCT 


TACACTTGCT 
ATGTGAACGA 


3851 


GATGCGGTGC 
CTACGCCACG 


TGATTACGAC 
ACTAATGCTG 


CGCTCACGCG 
GCGAGTGCGC 


TGGCAGCATC 
ACCGTCGTAG 


AGGGGAAAAC 
TCCCCTTTTG 


3901 


CTTATTTATC 
GAATAAATAG 


AGCCGGAAAA 
TCGGCCTTTT 


CCTACCGGAT 
GGATGGCCTA 


TGATGGTAGT 
ACTACCATCA 


GGTCAAATGG 
CCAGTTTACC 


3951 


CGATTACCGT 
GCTAATGGCA 


TGATGTTGAA 
ACTACAACTT 


GTGGCGAGCG 
CACCGCTCGC 


ATACACCGCA 
TATGTGGCGT 


TCCGGCGCGG 
AGGCCGCGCC 


4001 


ATTGGCCTGA 
' TAACCGGACT 


ACTGCCAGCT 
TGACGGTCGA 


GGCGCAGGTA 
CCGCGTCCAT 


GCAGAGCGGG 
CGTCTCGCCC 


TAAACTGGCT 
ATTTGACCGA 


4051 


CGGATTAGGG 
GCCTAATCCC 


CCGCAAGAAA 
GGCGTTCTTT 


ACTATCCCGA 
TGATAGGGCT 


CCGCCTTACT 
GGCGGAATGA 


GCCGCCTGTT 
CGGCGGACAA 


4©1 

" i " 


TTGACCGCTG 
AACTGGCGAC 


GGATCTGCCA 
CCTAGACGGT 


TTGTCAGACA 
AACAGTCTGT 


TGTATACCCC 
ACATATGGGG 


GTACGTCTTC 
CATGCAGAAG 


41151 

~15. 


CCGAGCGAAA 
GGCTCGCTTT 


ACGGTCTGCG 
TGCCAGACGC 


CTGCGGGACG 
GACGCCCTGC 


CGCGAATTGA 
GCGCTTAACT 


ATTATGGCCC 
TAATACCGGG 


42£1 

t a : 
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ACACCAGTGG 
TGTGGTCACC 


CGCGGCGACT 
GCGCCGCTGA 


TCCAGTTCAA 
AGGTCAAGTT 


CATCAGCCGC 
GTAGTCGGCG 


TACAGTCAAC 
ATGTCAGTTG 


4 |31 


AGCAACTGAT 
TCGTTGACTA 


GGAAACCAGC 
CCTTTGGTCG 


CATCGCCATC 
GTAGCGGTAG 


TGCTGCACGC 
ACGACGTGCG 


GGAAGAAGGC 
CCTTCTTCCG 


4 3#1 


ACATGGCTGA 
TGTACCGACT 


ATATCGACGG 
TATAGCTGCC 


TTTCCATATG 
AAAGGTATAC 


GGGATTGGTG 
CCCTAACCAC 


GCGACGACTC 
CGCTGCTGAG 


43.5=1 


CTGGAGCCCG 
GACCTCGGGC 


TCAGTATCGG 
AGTCATAGCG 


CGGAATTCCA 
GCCTTAAGGT 


GCTGAGCGCC 
CGACTCGCGG 


GGTCGCTACC 
CCAGCGATGG 


4401 


ATTACCAGTT 
TAATGGTCAA 


GGTCTGGTGT 
CCAGACCACA 


CAAAAAAGAT 
GTTTTTTCTA 


CTGGAGGTGG 
GACCTCCACC 


TGGCAGCAGG 
ACCGTCGTCC 


4451 


CCTTGGCGCG 
GGAACCGCGC 


CCGGATCCTT 
GGCCTAGGAA 


AATTAACAAT 
TTAATTGTTA 


TGACCGGTAA 
ACTGGCCATT 


TAATAGGTAG 
ATTATCCATC 


4501 


ATAAGTGACT 
TATTCACTGA 


GATTAGATGC 
CTAATCTACG 


ATTGATCCCT 
TAACTAGGGA 


CGACCAATTC 
GCTGGTTAAG 


CGGTTATTTT 
GCCAATAAAA 


4551 


CCACCATATT 
GGTGGTATAA 


GCCGTCTTTT 
CGGCAGAAAA 


GGCAATGTGA 
CCGTTACACT 


GGGCCCGGAA 
CCCGGGCCTT 


ACCTGGCCCT 
TGGACCGGGA 


4601 


GTCTTCTTGA 
CAGAAGAACT 


CGAGCATTCC 
GCTCGTAAGG 


TAGGGGTCTT 
ATCCCCAGAA 


TCCCCTCTCG 
AGGGGAGAGC 


CCAAAGGAAT 
GGTTTCCTTA 


4651 


GCAAGGTCTG 
CGTTCCAGAC 


TTGAATGTCG 
AACTTACAGC 


TGAAGGAAGC 
ACTTCCTTCG 


AGTTCCTCTG 
TCAAGGAGAC 


GAAGCTTCTT 
CTTCGAAGAA 


4701 


GAAGACAAAC 
CTTCTGTTTG 


AACGTCTGTA 
TTGCAGACAT 


GCGACCCTTT 
CGCTGGGAAA 


GCAGGCAGCG 
CGTCCGTCGC 


GAACCCCCCA 
CTTGGGGGGT 



4 751 CCTGGCGACA GGTGCCTCTG CGGCCAAAAG CCACGTGTAT AAGATACACC 
GGACCGCTGT CCACGGAGAC GCCGGTTTTC GGTGCACATA TTCTATGTGG 



4801 


TGCAAAGGCG 
ACGTTTCCGC 


GCACAACCCC 
CGTGTTGGGG 


AGTGCCACGT 
TCACGGTGCA 


TGTGAGTTGG 
ACACTCAACC 


ATAGTTGTGG 
TATCAACACC 


4851 


AAAGAGTCAA 
TTTCTCAGTT 


ATGGCTCTCC 
TACCGAGAGG 


TCAAGCGTAT 
AGTTCGCATA 


TCAACAAGGG 
AGTTGTTCCC 


GCTGAAGGAT 
CGACTTCCTA 


4901 


GCCCAGAAGG 
CGGGTCTTCC 


TACCCCATTG 
ATGGGGTAAC 


TATGGGATCT 
ATACCCTAGA 


GATCTGGGGC 
CTAGACCCCG 


CTCGGTGCAC 
GAGCCACGTG 


4951 


ATGCTTTACA 
TACGAAATGT 


TGTGTTTAGT 
ACACAAATCA 


CGAGGTTAAA 
GCTCCAATTT 


AAACGTCTAG 
TTTGCAGATC 


GCCCCCCGAA 
CGGGGGGCTT 


5001 

V- 3 


CCACGGGGAC 
GGTGCCCCTG 


GTGGTTTTCC 
CACCAAAAGG 


TTTGAAAAAC 
AAACTTTTTG 


ACGATGATAA 
TGCTACTATT 


TACCATGATT 
ATGGTACTAA 


GAACAAGATG 
CTTGTTCTAC 


GATTGCACGC 
CTAACGTGCG 


AGGTTCTCCG 
TCCAAGAGGC 


GCCGCTTGGG 
CGGCGAACCC 


TGGAGAGGCT 
ACCTCTCCGA 


5LQ1 


ATTCGGCTAT 
TAAGCCGATA 


GACT.GGGCAC 
CTGACCCGTG 


AACAGACAAT 
TTGTCTGTTA 


CGGCTGCTCT 
GCCGACGAGA 


GATGCCGCCG 
CTACGGCGGC 


* ** 

5151 

t = : 


TGTTCCGGCT 
ACAAGGCCGA 


GTCAGCGCAG 
CAGTCGCGTC 


GGGCGCCCGG 
CCCGCGGGCC 


TTCTTTTTGT 
AAGAAAAACA 


CAAGACCGAC 
GTTCTGGCTG 


s|ii 


CTGTCCGGTG 
GACAGGCCAC 


CCCTGAATGA 
GGGACTTACT 


ACTGCAGGAC 
TGACGTCCTG 


GAGGCAGCGC 
CTCCGTCGCG 


GGCTATCGTG 
CCGATAGCAC 


52pl 

: a 


GCTGGCCACG 
CGACCGGTGC 


ACGGGCGTTC 
TGCCCGCAAG 


CTTGCGCAGC 
GAACGCGTCG 


TGTGCTCGAC 
ACACGAGCTG 


GTTGTCACTG 
CAACAGTGAC 


5301 


AAGCGGGAAG 
TTCGCCCTTC 


GGACTGGCTG 
CCTGACCGAC 


CTATTGGGCG 
GATAACCCGC 


AAGTGCCGGG 
TTCACGGCCC 


GCAGGATCTC 
CGTCCTAGAG 


5351 


CTGTCATCTC 
G AC AG TAG AG 


ACCTTGCTCC 
TGGAACGAGG 


TGCCGAGAAA 
ACGGCTCTTT 


GTATCCATCA 
CATAGGTAGT 


TGGCTGATGC 
ACCGACTACG 


5401 


AATGCGGCGG 
TTACGCCGCC 


CTGCATACGC 
GACGTATGCG 


TTGATCCGGC 
AACTAGGCCG 


TACCTGCCCA 
ATGGACGGGT 


TTCGACCACC 
AAGCTGGTGG 


5451 


AAGCGAAACA 
TTCGCTTTGT 


TCGCATCGAG 
AGCGTAGCTC 


CGAGCACGTA 
GCTCGTGCAT 


CTCGGATGGA 
GAGCCTACCT 


AGCCGGTCTT 
TCGGCCAGAA 


5501 


GTCGATCAGG 
CAGCTAGTCC 


ATGATCTGGA 
TACTAGACCT 


CGAAGAGCAT 
GCTTCTCGTA 


CAGGGGCTCG 
GTCCCCGAGC 


CGCCAGCCGA 
GCGGTCGGCT 


5551 


ACTGTTCGCC 
TGACAAGCGG 


AGGCTCAAGG 
TCCGAGTTCC 


CGCGCATGCC 
GCGCGTACGG 


CGACGGCGAG 
GCTGCCGCTC 


GATCTCGTCG 
CTAGAGCAGC 


5601 


TGACCCATGG 
ACTGGGTACC 


CGATGCCTGC 
GCTACGGACG 


TTGCCGAATA 
AACGGCTTAT 


TCATGGTGGA 
AGTACCACCT 


AAATGGCCGC 
TTTACCGGCG 


5651 


TTTTCTGGAT 
AAAAGACCTA 


TCATCGACTG 
AGTAGCTGAC 


TGGCCGGCTG 
ACCGGCCGAC 


GGTGTGGCGG 
CCACACCGCC 


ACCGCTATCA 
TGGCGATAGT 



1 
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57 01 GGACATAGCG TTGGCTACCC GTGATATTGC TGAAGAGCTT GGCGGCGAAT 
CCTGTATCGC AACCGATGGG CACTATAACG ACTTCTCGAA CCGCCGCTTA 



5751 


GGGCTGACCG 
CCCGACTGGC 


CTTCCTCGTG 
GAAGGAGCAC 


CTTTACGGTA 
GAAATGCCAT 


TCGCCGCTCC 
AGCGGCGAGG 


CGATTCGCAG 
GCTAAGCGTC 


5801 


CGCATCGCCT 
GCGTAGCGGA 


TCTATCGCCT 
AGATAGCGGA 


TCTTGACGAG 
AGAACTGCTC 


TTCTTCTGAG 
AAGAAGACTC 


CGGGACTCTG 
GCCCTGAGAC 


5851 


GGGTTCGCAT 
CCCAAGCGTA 


CGATAAAATA 
GCTATTTTAT 


AAAGATTTTA 
TTTCTAAAAT 


TTTAGTCTCC 
AAATCAGAGG 


AGAAAAAGGG 
TCTTTTTCCC 


5901 


GGGAATGAAA 
CCCTTACTTT 


GACCCCACCT 
CTGGGGTGGA 


GTAGGTTTGG 
CATCCAAACC 


CAAGCTAGCT 
GTTCGATCGA 


TAAGTAACGC 
ATTCATTGCG 


5951 


CATTTTGCAA 
GTAAAACGTT 


GGCATGGAAA 
CCGTACCTTT 


AATACATAAC 
TTATGTATTG 


TGAGAATAGA 
ACTCTTATCT 


GAAGTTCAGA 
CTTCAAGTCT 


6Qgl 

* z s 


TCAAGGTCAG 
AGTTCCAGTC 


GAACAGATGG 
CTTGTCTACC 


AACAGCTGAA 
TTGTCGACTT 


TATGGGCCAA 
ATACCCGGTT 


ACAGGATATC 
TGTCCTATAG 


6b T 5l 


TGTGGTAAGC 
ACACCATTCG 


AGTTCCTGCC 
TCAAGGACGG 


CCGGCTCAGG 
GGCCGAGTCC 


GCCAAGAACA 
CGGTTCTTGT 


GATGGAACAG 
CTACCTTGTC 


6 1|91 
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CTGAATATGG 
GACTTATACC 


GCCAAACAGG 
CGGTTTGTCC 


ATATCTGTGG 
TATAGACACC 


TAAGCAGTTC 
ATTCGTCAAG 


CTGCCCCGGC 
GACGGGGCCG 


6|r^ 1 


TCAGGGCCAA 
AGTCCCGGTT 


GAACAGATGG 
CTTGTCTACC 


TCCCCAGATG 
AGGGGTCTAC 


CGGTCCAGCC 
GCCAGGTCGG 


CTCAGCAGTT 
GAGTCGTCAA 


V 3 

62D1 


TCTAGAGAAC 
AGATCTCTTG 


CATCAGATGT 
GTAGTCTACA 


TTCCAGGGTG 
AAGGTCCCAC 


CCCCAAGGAC 
GGGGTTCCTG 


CTGAAATGAC 
GACTTTACTG 


a - 

6 441 


CCTGTGCCTT 
GGACACGGAA 


ATTTGAACTA 
TAAACTTGAT 


ACCAATCAGT 
TGGTTAGTCA 


TCGCTTCTCG 
AGCGAAGAGC 


CTTCTGTTCG 
GAAGACAAGC 


6301 


CGCGCTTCTG 
GCGCGAAGAC 


CTCCCCGAGC 
GAGGGGCTCG 


TCAATAAAAG 
AGTTATTTTC 


AGCCCACAAC 
TCGGGTGTTG 


CCCTCACTCG 
GGGAGTGAGC 


6351 


GGGCGCCAGT 
CCCGCGGTCA 


CCTCCGATTG 
GGAGGCTAAC 


ACTGAGTCGC 
TGACTCAGCG 


CCGGGTACCC 
GGCCCATGGG 


GTGTATCCAA 
CACATAGGTT 


6401 


TAAACCCTCT 
ATTTGGGAGA 


TGCAGTTGCA 
ACGTCAACGT 


TCCGACTTGT 
AGGCTGAACA 


GGTCTCGCTG 
CCAGAGCGAC 


TTCCTTGGGA 
AAGGAACCCT 


6451 


GGGTCTCCTC 
CCCAGAGGAG 


TGAGTGATTG 
ACTCACTAAC 


ACTACCCGTC 
TGATGGGCAG 


AGCGGGGGTC 
TCGCCCCCAG 


TTTCATTCAT 
AAAGTAAGTA 


6501 


GCAGCATGTA 
CGTCGTACAT 


TCAAAATTAA 
AGTTTTAATT 


TTTGGTTTTT 
AAACCAAAAA 


TTTCTTAAGT 
AAAGAATTCA 


AT TT AC ATT A 
TAAATGTAAT 


6551 


AATGGCCATA 
TTACCGGTAT 


GTTGCATTAA 
CAACGTAATT 


TGAATCGGCC 
ACTTAGCCGG 


AACGCGCGGG 
TTGCGCGCCC 


GAGAGGCGGT 
CTCTCCGCCA 



6601 TTGCGTATTG GCGCTCTTCC GCTTCCTCGC TCACTGACTC GCTGCGCTCG 
AACGCATAAC CGCGAGAAGG CGAAGGAGCG AGTGACTGAG CGACGCGAGC 




6651 


GTCGTTCGGC 
CAGCAAGCCG 


TGCGGCGAGC 
ACGCCGCTCG 


GGTATCAGCT 
CCATAGTCGA 


CACTCAAAGG 
GTGAGTTTCC 


CGGTAATACG 
GCCATTATGC 


6701 


GTTATCCACA 
CAATAGGTGT 


GAATCAGGGG 
CTTAGTCCCC 


ATAACGCAGG 
TATTGCGTCC 


AAAGAACATG 
TTTCTTGTAC 


TGAGCAAAAG 
ACTCGTTTTC 


6751 


GCCAGCAAAA 
CGGTCGTTTT 


GGCCAGGAAC 
CCGGTCCTTG 


CGTAAAAAGG 
GCATTTTTCC 


CCGCGTTGCT 
GGCGCAACGA 


GGCGTTTTTC 
CCGCAAAAAG 


6801 


CATAGGCTCC 
GTATCCGAGG 


GCCCCCCTGA 
CGGGGGGACT 


CGAGCATCAC 
GCTCGTAGTG 


AAAAATCGAC 
TTTTTAGCTG 


GCTCAAGTCA 
CGAGTTCAGT 


6851 


GAGGTGGCGA 
CTCCACCGCT 


AACCCGACAG 
TTGGGCTGTC 


GACTATAAAG 
CTGATATTTC 


ATACCAGGCG 
TATGGTCCGC 


TTTCCCCCTG 
AAAGGGGGAC 


6901 


GAAGCTCCCT 
CTTCGAGGGA 


CGTGCGCTCT 
GCACGCGAGA 


CCTGTTCCGA 
GGACAAGGCT 


CCCTGCCGCT 
GGGACGGCGA 


TACCGGATAC 
ATGGCCTATG 


*— ~ 
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CTGTCCGCCT 
GACAGGCGGA 


TTCTCCCTTC 
AAGAGGGAAG 


GGGAAGCGTG 
CCCTTCGCAC 


GCGCTTTCTC 
CGCGAAAGAG 


ATAGCTCACG 
TATCGAGTGC 


7^di 


CTGTAGGTAT 
GACATCCATA 


CTCAGTTCGG 
GAGTCAAGCC 


TGTAGGTCGT 
ACATCCAGCA 


TCGCTCCAAG 
AGCGAGGTTC 


CTGGGCTGTG 
GACCCGACAC 


70iSi 


TGCACGAACC 
ACGTGCTTGG 


CCCCGTTCAG 
GGGGCAAGTC 


CCCGACCGCT 
GGGCTGGCGA 


GCGCCTTATC 
CGCGGAATAG 


CGGTAACTAT 
GCCATTGATA 


7 ES 1 

-3 


CGTCTTGAGT 
GCAGAACTCA 


CCAACCCGGT 
GGTTGGGCCA 


AAGACACGAC 
TTCTGTGCTG 


TTATCGCCAC 
AATAGCGGTG 


TGGCAGCAGC 
ACCGTCGTCG 


7^,1 

is; 
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CACTGGTAAC 

GTGACCATTG 


AGGATTAGCA 
TCCTAATCGT 


GAGCGAGGTA 
CTCGCTCCAT 


TGTAGGCGGT 
ACATCCGCCA 


GCTACAGAGT 
CGATGTCTCA 


72m 


TCTTGAAGTG 
AGAACTTCAC 


GTGGCCTAAC 
CACCGGATTG 


TACGGCTACA 
ATGCCGATGT 


CTAGAAGAAC 
GATCTTCTTG 


AGTATTTGGT 
TCATAAACCA 


7251 


ATCTGCGCTC 
TAGACGCGAG 


TGCTGAAGCC 
ACGACTTCGG 


AGTTACCTTC 
TCAATGGAAG 


GGAAAAAGAG 
CCTTTTTCTC 


TTGGTAGCTC 
AACCATCGAG 


7301 


TTGATCCGGC 
AACTAGGCCG 


AAACAAACCA 
TTTGTTTGGT 


CCGCTGGTAG 
GGCGACCATC 


CGGTGGTTTT 
GCCACCAAAA 


TTTGTTTGCA 
AAACAAACGT 


7351 


AGCAGCAGAT 
TCGTCGTCTA 


TACGCGCAGA 
ATGCGCGTCT 


AAAAAAGGAT 
TTTTTTCCTA 


CTCAAGAAGA 
GAGTTCTTCT 


TCCTTTGATC 
AGGAAACTAG 


7401 


TTTTCTACGG 
AAAAGATGCC 


GGTCTGACGC 
CCAGACTGCG 


TCAGTGGAAC 
AGTCACCTTG 


GAAAACTCAC 
CTTTTGAGTG 


GTTAAGGGAT 
CAATTCCCTA 


7451 


TTTGGTCATG 
AAACCAGTAC 


AGATTATCAA 
TCTAATAGTT 


AAAGGATCTT 
TTTCCTAGAA 


CACCTAGATC 
GTGGATCTAG 


CTTTTGCGGC 
GAAAACGCCG 


7501 


CGCAAATCAA 
GCGTTTAGTT 


TCTAAAGTAT 
AGATTTCATA 


ATATGAGTAA 
TATACTCATT 


ACTTGGTCTG 
TGAACCAGAC 


ACAGTTACCA 
TGTCAATGGT 


7551 


ATGCTTAATC 
TACGAATTAG 


AGTGAGGCAC 
TCACTCCGTG 


CTATCTCAGC 
GATAGAGTCG 


GATCTGTCTA 
CTAGACAGAT 


TTTCGTTCAT 
AAAGCAAGTA 



7 601 CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT ACGGGAGGGC 
GGTATCAACG GACTGAGGGG CAGCACATCT ATTGATGCTA TGCCCTCCCG 



7 651 TTACCATCTG GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC 
AATGGTAGAC CGGGGTCACG ACGTTACTAT GGCGCTCTGG GTGCGAGTGG 



7 701 GGCTCCAGAT TTATCAGCAA TAAACCAGCC AGCCGGAAGG GCCGAGCGCA 
CCGAGGTCTA AATAGTCGTT ATTTGGTCGG TCGGCCTTCC CGGCTCGCGT 



7 751 GAAGTGGTCC TGCAACTTTA TCCGCCTCCA TCCAGTCTAT TAATTGTTGC 
CTTCACCAGG ACGTTGAAAT AGGCGGAGGT AGGTCAGATA ATTAACAACG 



7 301 CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC GCAACGTTGT 
GCCCTTCGAT CTCATTCATC AAGCGGTCAA TTATCAAACG CGTTGCAACA 



7 851 TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT 
ACGGTAACGA TGTCCGTAGC ACCACAGTGC GAGCAGCAAA CCATACCGAA 



pOl CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG 
yg GTAAGTCGAG GCCAAGGGTT GCTAGTTCCG CTCAATGTAC TAGGGGGTAC 

-„—* — — — — — — — — — — — _ — -»__. — _ — — — — — ___.—._■.____________________________ 

§151 TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG 
u2 AACACGTTTT TTCGCCAATC GAGGAAGCCA GGAGGCTAGC AACAGTCTTC 

7 — ^ — — — — — — — — — — — — — — — — — ______________________ 

|g01 TAAGTTGGCC GCAGTGTTAT CACTCATGGT TATGGCAGCA CTGCATAATT 
SI ATTCAACCGG CGTCACAATA GTGAGTACCA ATACCGTCGT GACGTATTAA 

5 3 i 



ffi51 CTCTTACTGT CATGCCATCC GTAAGATGCT TTTCTGTGAC TGGTGAGTAC 
W GAGAATGACA GTACGGTAGG CATTCTACGA AAAGACACTG ACCACTCATG 



£#01 TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA GTTGCTCTTG 
01 AGTTGGTTCA GTAAGACTCT TAT C AC AT AC GCCGCTGGCT CAACGAGAAC 



&_*51 CCCGGCGTCA ATACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG 
GGGCCGCAGT TATGCCCTAT TATGGCGCGG TGTATCGTCT TGAAATTTTC 



8201 TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA 
ACGAGTAGTA ACCTTTTGCA AGAAGCCCCG CTTTTGAGAG TTCCTAGAAT 



8251 CCGCTGTTGA GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC 
GGCGACAACT CTAGGTCAAG CTACATTGGG TGAGCACGTG GGTTGACTAG 



8301 TTCAGCATCT TTTACTTTCA CCAGCGTTTC TGGGTGAGCA AAAACAGGAA 
AAGTCGTAGA AAATGAAAGT GGTCGCAAAG ACCCACTCGT TTTTGTCCTT 



8351 GGCAAAATGC CGCAAAAAAG GGAATAAGGG CGACACGGAA ATGTTGAATA 
CCGTTTTACG GCGTTTTTTC CCTTATTCCC GCTGTGCCTT TACAACTTAT 



8401 CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC AGGGTTATTG 
GAGTATGAGA AGGAAAAAGT. TATAATAACT TCGTAAATAG TCCCAATAAC 



8451 TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG 
AGAGTACTCG CCTATGTATA AACTTACATA AATCTTTTTA TTTGTTTATC 



8501 GGGTTCCGCG CACATTTC 
CCCAAGGCGC GTGTAAAG 
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Figure 12A 



1 


CTGCAGCCTG 
GACGTCGGAC 


AATATGGGCC 
TTATACCCGG 


AAACAGGATA 
TTTGTCCTAT 


TCTGTGGTAA 
AGACACCATT 


GCAGTTCCTG 
CGTCAAGGAC 


51 


CCCCGGCTCA 
GGGGCCGAGT 


GGGCCAAGAA 
CCCGGTTCTT 


CAGATGGAAC 
GTCTACCTTG 


AGCTGAATAT 
TCGACTTATA 


GGGCCAAACA 
CCCGGTTTGT 


101 


GGATATCTGT 
CCTATAGACA 


GGTAAGCAGT 
CCATTCGTCA 


TCCTGCCCCG 
AGGACGGGGC 


GCTCAGGGCC 
CGAGTCCCGG 


AAGAACAGAT 
TTCTTGTCTA 


151 


GGTCCCCAGA 
CCAGGGGTCT 


TGCGGTCCAG 
ACGCCAGGTC 


CCCTCAGCAG 
GGGAGTCGTC 


TTTCTAGAGA 
AAAGATCTCT 


ACCATCAGAT 
TGGTAGTCTA 


201 


GTTTCCAGGG 
CAAAGGTCCC 


TGCCCCAAGG 
ACGGGGTTCC 


ACCTGAAATG 
TGGACTTTAC 


ACCCTGTGCC 
TGGGACACGG 


TTATTTGAAC 
AATAAACTTG 


251 


TAACCAATCA 
ATTGGTTAGT 


GTTCGCTTCT 
CAAGCGAAGA 


CGCTTCTGTT 
GCGAAGACAA 


CGCGCGCTTC 
GCGCGCGAAG 


TGCTCCCCGA 
ACGAGGGGCT 


If 1 

SJ 


GCTCAATAAA 
CGAGTTATTT 


AGAGCCCACA 
TCTCGGGTGT 


ACCCCTCACT 
TGGGGAGTGA 


CGGGGCGCCA 
GCCCCGCGGT 


GTCCTCCGAT 
CAGGAGGCTA 


5 ™ 
S5S 

151 


TGACTGAGTC 
ACTGACTCAG 


GCCCGGGTAC 
CGGGCCCATG 


CCGTGTATCC 
GGCACATAGG 


AATAAACCCT 
TTATTTGGGA 


CTTGCAGTTG 
GAACGTCAAC 


yii 

fU 


CATCCGACTT 
GTAGGCTGAA 


GTGGTCTCGC 
CACCAGAGCG 


TGTTCCTTGG 
ACAAGGAACC 


GAGGGTCTCC 
CTCCCAGAGG 


TCTGAGTGAT 
AGACTCACTA 




TGACTACCCG 
ACTGATGGGC 


TCAGCGGGGG 
AGTCGCCCCC 


TCTTTCATTT 
AGAAAGTAAA 


GGGGGCTCGT 
CCCCCGAGCA 


CCGGGATCGG 
GGCCCTAGCC 


koi 


GAGACCCCTG 
CTCTGGGGAC 


CCCAGGGACC 
GGGTCCCTGG 


ACCGACCCAC 
TGGCTGGGTG 


CACCGGGAGG 
GTGGCCCTCC 


CAAGCTGGCC 
GTTCGACCGG 


551 


AGCAACTTAT 
TCGTTGAATA 


CTGTGTCTGT 
GACACAGACA 


CCGATTGTCT 
GGCTAACAGA 


AGTGTCTATG 
TC AC AG AT AC 


ACTGATTTTA 
TGACTAAAAT 


601 


TGCGCCTGCG 
ACGCGGACGC 


TCGGTACTAG 
AGCCATGATC 


TTAGCTAACT 
AATCGATTGA 


AGCTCTGTAT 
TCGAGACATA 


CTGGCGGACC 
GACCGCCTGG 


651 


CGTGGTGGAA 
GCACCACCTT 


CTGACGAGTT 
GACTGCTCAA 


CTGAACACCC 
GACTTGTGGG 


GGCCGCAACC 
CCGGCGTTGG 


CTGGGAGACG 
GACCCTCTGC 


701 


TCCCAGGGAC 
AGGGTCCCTG 


TTTGGGGGCC 
AAACCCCCGG 


GTTTTTGTGG 
CAAAAACACC 


CCCGACCTGA 
GGGCTGGACT 


GGAAGGGAGT 
CCTTCCCTCA 


751 


CGATGTGGAA 
GCTACACCTT 


TCCGACCCCG 
AGGCTGGGGC 


TCAGGATATG 
AGTCCTATAC 


TGGTTCTGGT 
ACCAAGACCA 


AGGAGACGAG 
TCCTCTGCTC 


801 


AACCTAAAAC 
TTGGATTTTG 


AGTTCCCGCC 
TCAAGGGCGG 


TCCGTCTGAA 
AGGCAGACTT 


TTTTTGCTTT 
AAAAACGAAA 


CGGTTTGGAA 
GCCAAACCTT 


851 


CCGAAGCCGC 
GGCTTCGGCG 


GCGTCTTGTC 
CGCAGAACAG 


TGCTGCAGCA 
ACGACGTCGT 


TCGTTCTGTG 
AG C AAG AC AC 


TTGTCTCTGT 
AACAGAGACA 


901 


CTGACTGTGT 
G ACT G AC AC A 


TTCTGTATTT 
AAGACATAAA 


GTCTGAAAAT 
CAGACTTTTA 


TAGGGCCAGA 
ATCCCGGTCT 


CTGTTACCAC 
GACAATGGTG 



FIGURE 12B 
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951 TCCCTTAAGT TTGACCTTAG GTAACTGGAA AGATGTCGAG CGGCTCGCTC 
AGGGAATTCA AACTGGAATC CATTGACCTT TCTACAGCTC GCCGAGCGAG 



001 


ACAACCAGTC 
TGTTGGTCAG 


GGTAGATGTC 
CCATCTACAG 


AAGAAGAGAC 
TTCTTCTCTG 


GTTGGGTTAC 
CAACCCAATG 


CTTCTGCTCT 
GAAGACGAGA 


051 


GCAGAATGGC 
CGTCTTACCG 


CAACCTTTAA 
GTTGGAAATT 


CGTCGGATGG 
.GCAGCCTACC 


CCGCGAGACG 
GGCGCTCTGC 


GCACCTTTAA 
CGTGGAAATT 


101 


CCGAGACCTC 
GGCTCTGGAG 


ATCACCCAGG 
TAGTGGGTCC 


TTAAGATCAA 
AATTCTAGTT 


GGTCTTTTCA 
CCAGAAAAGT 


CCTGGCCCGC 
GGACCGGGCG 


151 


ATGGACACCC 
TACCTGTGGG 


AGACCAGGTC 
TCTGGTCCAG 


CCCTACATCG 
GGGATGTAGC 


TGACCTGGGA 
ACTGGACCCT 


AGCCTTGGCT 
TCGGAACCGA 


201 


TTTGACCCCC 
AAACTGGGGG 


CTCCCTGGGT 
GAGGGACCCA 


CAAGCCCTTT 
GTTCGGGAAA 


GTACACCCTA 
CATGTGGGAT 


AGCCTCCGCC 
TCGGAGGCGG 



s ■ : 

12^11 TCCTCTTCCT CCATCCGCCC CGTCTCTCCC CCTTGAACCT CCTCGTTCGA 
AGGAGAAGGA GGTAGGCGGG GCAGAGAGGG GGAACTTGGA GGAGCAAGCT 



1^6iL CCCCGCCTCG ATCCTCCCTT TATCCAGCCC TCACTCCTTC TCTAGGCGCC 
ip GGGGCGGAGC TAGGAGGGAA ATAGGTCGGG AGTGAGGAAG AGATCCGCGG 

. 

13JI GGCCGCTCTA GCCCATTAAT ACGACTCACT ATAGGGCGAT TCGAATCAGG 
fy CCGGCGAGAT CGGGTAATTA TGCTGAGTGA TATCCCGCTA AGCTTAGTCC 

14gjL CCTTGGCGCG CCGGATCCTT AATTAAGCGC AATTGGGAGG TGGCGGTAGC 
j~ GGAACCGCGC GGCCTAGGAA TTAATTCGCG TTAACCCTCC ACCGCCATCG 

14§J CTCGAGATGG GCGTGATTAC GGATTCACTG GCCGTCGTTT TACAACGTCG 
GAGCTCTACC CGCACTAATG CCTAAGTGAC CGGCAGCAAA ATGTTGCAGC 



15bl TGACTGGGAA AACCCTGGCG TTACCCAACT TAATCGCCTT GCAGCACATC 
ACTGACCCTT TTGGGACCGC AATGGGTTGA ATTAGCGGAA CGTCGTGTAG 



1551 CCCCTTTCGC CAGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT 
GGGGAAAGCG GTCGACCGCA TTATCGCTTC TCCGGGCGTG GCTAGCGGGA 



1601 TCCCAACAGT TACGCAGCCT GAATGGCGAA TGGCGCTTTG CCTGGTTTCC 
AGGGTTGTCA ATGCGTCGGA CTTACCGCTT ACCGCGAAAC GGACCAAAGG 



651 


GGCACCAGAA 
CCGTGGTCTT 


GCGGTGCCGG 
CGCCACGGCC 


AAAGCTGGCT 
TTTCGACCGA 


GGAGTGCGAT 
CCTCACGCTA 


CTTCCTGAGG 
GAAGGACTCC 


701 


CCGATACTGT 
GGCTATGACA 


CGTCGTCCCC 
GCAGCAGGGG 


TCAAACTGGC 
AGTTTGACCG 


AGATGCACGG 
TCTACGTGCC 


TTACGATGCG 
AATGCTACGC 


751 


CCCATCTACA 
GGGTAGATGT 


CCAACGTGAC 
GGTTGCACTG 


CTATCCCATT 
GATAGGGTAA 


ACGGTCAATC 
TGCCAGTTAG 


CGCCGTTTGT 
GCGGCAAACA 



18 01 TCCCACGGAG AATCCGACGG GTTGTTACTC GCTCACATTT AATGTTGATG 
AGGGTGCCTC TTAGGCTGCC CAACAATGAG CGAGTGTAAA TTACAACTAC 



1851 AAAGCTGGCT ACAGGAAGGC CAGACGCGAA TTATTTTTGA TGGCGTTAAC 
TTTCGACCGA TGTCCTTCCG GTCTGCGCTT AATAAAAACT ACCGCAATTG 



1901 TCGGCGTTTC ATCTGTGGTG CAACGGGCGC TGGGTCGGTT ACGGCCAGGA 
AGCCGCAAAG TAGACACCAC GTTGCCCGCG ACCCAGCCAA TGCCGGTCCT 



951 


CAGTCGTTTG 
GTCAGCAAAC 


CCGTCTGAAT 
GGCAGACTTA 


TTGACCTGAG 
AACTGGACTC 


CGCATTTTTA 
GCGTAAAAAT 


CGCGCCGGAG 
GCGCGGCCTC 


001 


AAAACCGCCT 
TTTTGGCGGA 


CGCGGTGATG 
GCGCCACTAC 


GTGCTGCGCT 
CACGACGCGA 


GGAGTGACGG 
CCTCACTGCC 


CAGTTATCTG 
GTCAATAGAC 


051 


GAAGATCAGG 
CTTCTAGTCC 


ATATGTGGCG 
TATACACCGC 


GATGAGCGGC 
CTACTCGCCG 


ATTTTCCGTG 
TAAAAGGCAC 


ACGTCTCGTT 
TGCAGAGCAA 


101 


GCTGCATAAA 
CGACGTATTT 


CCGACTACAC 
GGCTGATGTG 


AAATCAGCGA 
TTTAGTCGCT 


TTTCCATGTT 
AAAGGTACAA 


GCCACTCGCT 
CGGTGAGCGA 


151 


TTAATGATGA 
AATTACTACT 


TTTCAGCCGC 
AAAGTCGGCG 


GCTGTACTGG 
CGACATGACC 


AGGCTGAAGT 
TCCGACTTCA 


TCAGATGTGC 
AGTCTACACG 


2®. 


GGCGAGTTGC 
CCGCTCAACG 


GTGACTACCT 
CACTGATGGA 


ACGGGTAACA 
TGCCCATTGT 


GTTTCTTTAT 
CAAAGAAATA 


GGCAGGGTGA 
CCGTCCCACT 



22^1 AACGCAGGTC GCCAGCGGCA CCGCGCCTTT CGGCGGTGAA ATTATCGATG 
*0 TTGCGTCCAG CGGTCGCCGT GGCGCGG AAA GCCGCCACTT TAATAGCTAC 



a - _ _ _ — — — — — — — ——— — — — — — — — — — — — — — — — — — ^ — — — — — - 

23P1 AGCGTGGTGG TTATGCCGAT CGCGTCACAC TACGTCTGAA CGTCGAAAAC 
Si TCGCACCACC AATACGGCTA GCGCAGTGTG ATGCAGACTT GCAGCTTTTG 



23%i CCGAAACTGT GGAGCGCCGA AATCCCGAAT CTCTATCGTG CGGTGGTTGA 
H GGCTTTGACA CCTCGCGGCT TTAGGGCTTA GAGATAGCAC GCCACCAACT 



1 z 

2 4 81 ACTGCACACC GCCGACGGCA CGCTGATTGA AGCAGAAGCC TGCGATGTCG 

y_* TGACGTGTGG CGGCTGCCGT GCGACTAACT TCGTCTTCGG ACGCTACAGC 

* ** 

^* tSFjsU* ,— ^ ^ ^™ ^™ ^™ ^ ^* ^™ ^ ^ m 

24 54 GTTTCCGCGA GGTGCGGATT GAAAATGGTC TGCTGCTGCT GAACGGCAAG 

CAAAGGCGCT CCACGCCTAA CTTTTACCAG ACGACGACGA CTTGCCGTTC 



2501 CCGTTGCTGA TTCGAGGCGT TAACCGTCAC GAGCATCATC CTCTGCATGG 
GGCAACGACT AAGCTCCGCA ATTGGCAGTG CTCGTAGTAG GAGACGTACC 



2551 TCAGGTCATG GATGAGCAGA CGATGGTGCA GGATATCCTG CTGATGAAGC 
AGTCCAGTAC CTACTCGTCT GCTACCACGT CCTATAGGAC GACTACTTCG 



601 


AGAACAACTT 
TCTTGTTGAA 


TAACGCCGTG 
ATTGCGGCAC 


CGCTGTTCGC 
GCGACAAGCG 


ATTATCCGAA 
TAATAGGCTT 


CCATCCGCTG 
GGTAGGCGAC 


651 


TGGTACACGC 
ACCATGTGCG 


TGTGCGACCG 
ACACGCTGGC 


CTACGGCCTG 
GATGCCGGAC 


TATGTGGTGG 
ATACACCACC 


ATGAAGCCAA 
TACTTCGGTT 


701 


TATTGAAACC 
ATAACTTTGG 


CACGGCATGG 
GTGCCGTACC 


TGCCAATGAA 
ACGGTTACTT 


TCGTCTGACC 
AGCAGACTGG 


GATGATCCGC 
CTACTAGGCG 


751 


GCTGGCTACC 
CGACCGATGG 


GGCGATGAGC 
CCGCTACTCG 


GAACGCGTAA 
CTTGCGCATT 


CGCGAATGGT 
GCGCT.TACCA 


GCAGCGCGAT 
CGTCGCGCTA 


801 


CGTAATCACC 
GCATTAGTGG 


CGAGTGTGAT 
GCTCACACTA 


CATCTGGTCG 
GTAGACCAGC 


CTGGGGAATG 
GACCCCTTAC 


AATCAGGCCA 
TTAGTGCGGT 



2851 CGGCGCTAAT CACGACGCGC TGTATCGCTG GATCAAATCT GTCGATCCTT 
GCCGCGATTA GTGCTGCGCG ACATAGCGAC CTAGTTTAGA CAGCTAGGAA 



2901 


CCCGCCCGGT 
GGGCGGGCCA 


GCAGTATGAA 
CGTCATACTT 


GGCGGCGGAG 
CCGCCGCCTC 


CCGACACCAC 
GGCTGTGGTG 


GGCCACCGAT 
CCGGTGGCTA 


2951 


ATTATTTGCC 
TAATAAACGG 


CGATGTACGC 
GCTACATGCG 


GCGCGTGGAT 
CGCGCACCTA 


GAAGACCAGC 
CTTCTGGTCG 


CCTTCCCGGC 
GGAAGGGCCG 


3001 


TGTGCCGAAA 
ACACGGCTTT 


TGGTCCATCA 
ACCAGGTAGT 


AAAAATGGCT 
TTTTTACCGA 


TTCGCTACCT 
AAGCGATGGA 


GGAGAGACGC 
CCTCTCTGCG 


3051 


GCCCGCTGAT 
CGGGCGACTA 


CCTTTGCGAA 
GGAAACGCTT 


TACGCCCACG 
ATGCGGGTGC 


CGATGGGTAA 
GCTACCCATT 


CAGTCTTGGC 
GTCAGAACCG 


3101 


GGTTTCGCTA 
CCAAAGCGAT 


AATACTGGCA 
TTATGACCGT 


GGCGTTTCGT 
CCGCAAAGCA 


CAGTATCCCC 
GTCATAGGGG 


GTTTACAGGG 
CAAATGTCCC 


3 ft? 1 

i s 's 


CGGCTTCGTC 
GCCGAAGCAG 


TGGGACTGGG 
ACCCTGACCC 


TGGATCAGTC 
ACCTAGTCAG 


GCTGATTAAA 
CGACTAATTT 


TATGATGAAA 
ATACTACTTT 


3201 


ACGGCAACCC 
TGCCGTTGGG 


GTGGTCGGCT 
CACCAGCCGA 


TACGGCGGTG 
ATGCCGCCAC 


ATTTTGGCGA 
TAAAACCGCT 


TACGCCGAAC 
ATGCGGCTTG 


3 4s x 

f% 3 
1 ==? 


GATCGCCAGT 
CTAGCGGTCA 


TCTGTATGAA 
AGACATACTT 


CGGTCTGGTC 
GCCAGACCAG 


TTTGCCGACC 
AAACGGCTGG 


GCACGCCGCA 
CGTGCGGCGT 


3ioa 

£ K 


TCCAGCGCTG 
AGGTCGCGAC 


ACGGAAGCAA 
TGCCTTCGTT 


AACACCAGCA 
TTGTGGTCGT 


GCAGTTTTTC 
CGTCAAAAAG 


CAGTTCCGTT 
GTCAAGGCAA 


3JI1 

y s 

3<J#1 


TATCCGGGCA 
ATAGGCCCGT 


AACCATCGAA 
TTGGTAGCTT 


GTGACCAGCG 
CACTGGTCGC 


AATACCTGTT 
TTATGGACAA 


CCGTCATAGC 
GGCAGTATCG 


GATAACGAGC 
CTATTGCTCG 


TCCTGCACTG 
AGGACGTGAC 


GATGGTGGCG 
CTACCACCGC 


CTGGATGGTA 
GACCTACCAT 


AGCCGCTGGC 
TCGGCGACCG 


3451 


AAGCGGTGAA 
TTCGCCACTT 


GTGCCTCTGG 
CACGGAGACC 


ATGTCGCTCC 
TACAGCGAGG 


AC AAGG T AAA 
TGTTCCATTT 


CAGTTGATTG 
GTCAACTAAC 


3501 


AACTGCCTGA 
TTGACGGACT 


ACTACCGCAG 
TGATGGCGTC 


CCGGAGAGCG 
GGCCTCTCGC 


CCGGGCAACT 
GGCCCGTTGA 


CTGGCTCACA 
GACCGAGTGT 


3551 


GTACGCGTAG 
CATGCGCATC 


TGCAACCGAA 
ACGTTGGCTT 


CGCGACCGCA 
GCGCTGGCGT 


TGGTCAGAAG 
ACCAGTCTTC 


CCGGGCACAT 
GGCCCGTGTA 


3601 


CAGCGCCTGG 
GTCGCGGACC 


CAGCAGTGGC 
GTCGTCACCG 


GTCTGGCGGA 
CAGACCGCCT 


AAACCTCAGT' 
TTTGGAGTCA 


GTGACGCTCC 
CACTGCGAGG 


3651 


CCGCCGCGTC 
GGCGGCGCAG 


CCACGCCATC 
GGTGCGGTAG 


CCGCATCTGA 
GGCGTAGACT 


CCACCAGCGA 
GGTGGTCGCT 


AATGGATTTT 
TTACCTAAAA 


3701 


TGCATCGAGC 
ACGTAGCTCG 


TGGGTAATAA 
ACCCATTATT 


GCGTTGGCAA 
CGCAACCGTT 


TTTAACCGCC 
AAATTGGCGG 


AGTCAGGCTT 
TCAGTCCGAA 


3751 


TCTTTCACAG 
AGAAAGTGTC 


ATGTGGATTG 
TACACCTAAC 


GCGATAAAAA 
CGCTATTTTT 


ACAACTGCTG 
TGTTGACGAC 


ACGCCGCTGC 
TGCGGCGACG 



1 » 



3301 


GCGATCAGTT 
CGCTAGTCAA 


CACCCGTGTC 
GTGGGCACAG 


GATAGATCTG 
CTATCTAGAC 


AACAGAAACT 
TTGTCTTTGA 


CATTTCCGAA 
GTAAAGGCTT 


3851 


GAAGACCTAG 
CTTCTGGATC 


TCGACCATCA 
AGCTGGTAGT 


TCATCATCAT 
AGTAGTAGTA 


CACCGGTAAT 
GTGGCCATTA 


AATAGGTAGA 
TTATCCATCT 


3901 


TAAGTGACTG 
ATTCACTGAC 


ATTAGATGCA 
TAATCTACGT 


TTTCGACTAG 
AAAGCTGATC 


ATCCCTCGAC 
TAGGGAGCTG 


CAATTCCGGT 
GTTAAGGCCA 


3951 


TATTTTCCAC 
ATAAAAGGTG 


CATATTGCCG 
GTATAACGGC 


TCTTTTGGCA 
AGAAAACCGT 


ATGTGAGGGC 
TACACTCCCG 


CCGGAAACCT 
GGCCTTTGGA 


4001 


GGCCCTGTCT 
CCGGGACAGA 


TCTTGACGAG 
AGAACTGCTC 


CATTCCTAGG 
GTAAGGATCC 


GGTCTTTCCC 
CCAGAAAGGG 


CTCTCGCCAA 
GAGAGCGGTT 


4051 


AGGAATGCAA 
TCCTTACGTT 


GGTCTGTTGA 
CCAGACAACT 


ATGTCGTGAA 
TACAGCACTT 


GGAAGCAGTT 
CCTTCGTCAA 


CCTCTGGAAG 
GGAGACCTTC 


4^31 


CTTCTTGAAG 
GAAGAACTTC 


ACAAACAACG 
TGTTTGTTGC 


TCTGTAGCGA 
AGACATCGCT 


CCCTTTGCAG 
GGGAAACGTC 


GCAGCGGAAC 
CGTCGCCTTG 


4151 


CCCCCACCTG 
GGGGGTGGAC 


GCGACAGGTG 
CGCTGTCCAC 


CCTCTGCGGC 
GGAGACGCCG 


CAAAAGCCAC 
GTTTTCGGTG 


GTGTATAAGA 
CACATATTCT 


4g§l 

n i 


TACACCTGCA 
ATGTGGACGT 


AAGGCGGCAC 
TTCCGCCGTG 


AACCCCAGTG 
TTGGGGTCAC 


CCACGTTGTG 
GGTGCAACAC 


AGTTGGATAG 
TCAACCTATC 


4g§l 


TTGTGGAAAG 
AACACCTTTC 


AGTCAAATGG 
TCAGTTTACC 


CTCTCCTCAA 
GAGAGGAGTT 


GCGTATTCAA 
CGCATAAGTT 


CAAGGGGCTG 
GTTCCCCGAC 


O : 


AAGGATGCCC 
TTCCTACGGG 


AGAAGGTACC 
TCTTCCATGG 


CCATTGTATG 
GGTAACATAC 


GGATCTGATC 
CCTAGACTAG 


TGGGGCCTCG 
ACCCCGGAGC 


45tl 


GTGCACATGC 
CACGTGTACG 


TTTACATGTG 
AAATGTACAC 


TTTAGTCGAG 
AAATCAGCTC 


GTTAAAAAAC 
CAATTTTTTG 


GTCTAGGCCC 
CAGATCCGGG 


4401 


CCCGAACCAC 
GGGCTTGGTG 


GGGGACGTGG 
CCCCTGCACC 


TTTTCCTTTG 
AAAAGGAAAC 


AAAAACACGA 
TTTTTGTGCT 


TGATAATACC 
ACTATTATGG 


4451 


ATGAAAAAGC 
TACTTTTTCG 


CTGAACTCAC 
GACTTGAGTG 


CGCGACGTCT 
GCGCTGCAGA 


GTCGAGAAGT 
CAGCTCTTCA 


TTCTGATCGA 
AAGACTAGCT 


4501 


AAAGTTCGAC 
TTTCAAGCTG 


AGCGTCTCCG 
TCGCAGAGGC 


ACCTGATGCA 
TGGACTACGT 


GCTCTCGGAG 
CGAGAGCCTC 


GGCGAAGAAT 
CCGCTTCTTA 


4551 


CTCGTGCTTT 
GAGCACGAAA 


CAGCTTCGAT 
GTCGAAGCTA 


GTAGGAGGGC 
CATCCTCCCG 


GTGGATATGT 
CACCTATACA 


CCTGCGGGTA 
GGACGCCCAT 


4601 


AATAGCTGCG 
TTATCGACGC 


CCGATGGTTT 
GGCTACCAAA 


CTACAAAGAT 
GATGTTTCTA 


CGTTATGTTT 
GCAATACAAA 


ATCGGCACTT 
TAGCCGTGAA 


4651 


TGCATCGGCC 
ACGTAGCCGG 


GCGCTCCCGA 
CGCGAGGGCT 


TTCCGGAAGT 
AAGGCCTTCA 


GCTTGACATT 
CGAACTGTAA 


GGGGAATTTA 
CCCCTTAAAT 


4701 


GCGAGAGCCT 
CGCTCTCGGA 


GACCTATTGC 
CTGGATAACG 


ATCTCCCGCC 
TAGAGGGCGG 


GTGCACAGGG 
CACGTGTCCC 


TGTCACGTTG 
ACAGTGCAAC 



4751 CAAGACCTGC CTGAAACCGA ACTGCCCGCT GTTCTGCAGC CGGTCGCGGA 
GTTCTGGACG GACTTTGGCT TGACGGGCGA CAAGACGTCG GCCAGCGCCT 



4801 


GGCCATGGAT 
CCGGTACCTA 


GCGATCGCTG 
CGCTAGCGAC 


CGGCCGATCT 
GCCGGCTAGA 


TAGCCAGACG 
ATCGGTCTGC 


AGCGGGTTCG 
TCGCCCAAGC 


4851 


GCCCATTCGG 
CGGGTAAGCC 


ACCGCAAGGA 
TGGCGTTCCT 


ATCGGTCAAT 
TAGCCAGTTA 


ACACTACATG 
TGTGATGTAC 


GCGTGATTTC 
CGCACTAAAG 


4901 


ATATGCGCGA 
TATACGCGCT 


TTGCTGATCC 
AACGACTAGG 


CCATGTGTAT 
GGTACACATA 


CACTGGCAAA 
GTGACCGTTT 


CTGTGATGGA 
GACACTACCT 


4951 . 


. CGACACCGTC 
GCTGTGGCAG 


AGTGCGTCCG 
TCACGCAGGC 


TCGCGCAGGC 
AGCGCGTCCG 


TCTCGATGAG 
AGAGCTACTC 


CTGATGCTTT 
GACTACGAAA 


5001 


GGGCCGAGGA 
CCCGGCTCCT 


CTGCCCCGAA 
GACGGGGCTT 


GTCCGGCACC 
CAGGCCGTGG 


TCGTGCACGC 
AGCACGTGCG 


GGATTTCGGC 
CCTAAAGCCG 




TCCAACAATG 
AGGTTGTTAC 


TCCTGACGGA 
AGGACTGCCT 


CAATGGCCGC 
GTTACCGGCG 


ATAACAGCGG 
TATTGTCGCC 


TCATTGACTG 
AGTAACTGAC 


GAGCGAGGCG 
CTCGCTCCGC 


ATGTTCGGGG 
TACAAGCCCC 


ATTCCCAATA 
TAAGGGTTAT 


CGAGGTCGCC 
GCTCCAGCGG 


AACATCTTCT 
TTGTAGAAGA 


5iil 


TCTGGAGGCC 
AGACCTCCGG 


GTGGTTGGCT 
CACCAACCGA 


TGTATGGAGC 
ACATACCTCG 


AGCAGACGCG 
TCGTCTGCGC 


CTACTTCGAG 
GATGAAGCTC . 


5zte 


CGGAGGCATC 
GCCTCCGTAG 


CGGAGCTTGC 
GCCTCGAACG 


AGGATCGCCG 
TCCTAGCGGC 


CGGCTCCGGG 
GCCGAGGCCC 


CGTATATGCT • 
GCATATACGA 


52gl 


CCGCATTGGT 
GGCGTAACCA 


CTTGACCAAC 
GAACTGSTTG 


TCTATCAGAG 
AGATAGTCTC 


CTTGGTTGAC 
GAACCAACTG 


GGCAATTTCG 
CCGTTAAAGC 


E 

5301 


ATGATGCAGC 
TACTACGTCG 


TTGGGCGCAG 
AACCCGCGTC 


GGTCGATGCG 
CCAGCTACGC 


ACGCAATCGT 
TGCGTTAGCA 


CCGATCCGGA 
GGCTAGGCCT 


5351 


GCCGGGACTG 
CGGCCCTGAC 


TCGGGCGTAC 
AGCCCGCATG 


ACAAATCGCC 
TGTTTAGCGG 


CGCAGAAGCG 
GCGTCTTCGC 


CGGCCGTCTG 
GCCGGCAGAC 


5401 


GACCGATGGC 
CTGGCTACCG 


TGTGTAGAAG 
ACACATCTTC 


TACTCGCCGA 
ATGAGCGGCT 


TAGTGGAAAC 
ATCACCTTTG 


CGACGCCCCA 
GCTGCGGGGT 


5451 


GCACTCGTCC 
CGTGAGCAGG 


GAGGGCAAAG 
CTCCCGTTTC 


GAATAGAGTA 
CTTATCTCAT 


GATGCCGACC 
CTACGGCTGG 


GGGATCTATC 
CCCTAGATAG 


5501 


GATAAAATAA 
CTATTTTATT 


AAGATTTTAT 
TTCTAAAATA 


TTAGTGTCCA 
AATCAGAGGT 


GAAAAAGGGG 
CTTTTTCCCC 


GGAATGAAAG 
CCTTACTTTC 


5551 


ACCCCACCTG 
TGGGGTGGAC 


TAGGTTTGGC 
ATCCAAACCG 


AAGCTAGCTT 
TTCGATCGAA 


AAGTAACGCC 
TTCATTGCGG 


ATTTTGCAAG 
TAAAACGTTC 


5601 


GCATGGAAAA 
CGTACCTTTT 


ATACATAACT 
TATGTATTGA 


GAGAATAGAG 
CTCTTATCTC 


AAGTTCAGAT 
TTCAAGTCTA 


CAAGGTCAGG 
GTTCCAGTCC 


5651 


AACAGATGGA 
TTGTCTACCT 


ACAGCTGAAT 
TGTCGACTTA 


ATGGGCCAAA 
TACCCGGTTT 


CAGGATATCT 
GTCCTATAGA 


g.tggtaagca 
Caccattcgt 



57 01 GTTCCTGCCC CGGCTCAGGG CCAAGAACAG ATGGAACAGC TGAATATGGG 
CAAGGACGGG GCCGAGTCCC GGTTCTTGTC TACCTTGTCG ACTTATACCC 



57 51 CCAAACAGGA TATCTGTGGT AAGCAGTTCC TGCCCCGGCT CAGGGCCAAG 
GGTTTGTCCT ATAGACACCA TTCGTCAAGG ACGGGGCCGA GTCCCGGTTC 



5801 AACAGATGGT CCCCAGATGC GGTCCAGCCC TCAGCAGTTT CTAGAGAACC 
TTGTCTACCA GGGGTCTACG CCAGGTCGGG AGTCGTCAAA GATCTCTTGG 



5851 ATCAGATGTT TCCAGGGTGC CCCAAGGACC TGAAATGACC CTGTGCCTTA 
TAGTCTACAA AGGTCCCACG GGGTTCCTGG ACTTTACTGG GACACGGAAT 



5901 TTTGAACTAA CCAATCAGTT CGCTTCTCGC TTCTGTTCGC GCGCTTCTGC 
AAACTTGATT GGTTAGTCAA GCGAAGAGCG AAGACAAGCG CGCGAAGACG 



5951 TCCCCGAGCT CAATAAAAGA GCCCACAACC CCTCACTCGG GGCGCCAGTC 
AGGGGCTCGA GTTATTTTCT CGGGTGTTGG GGAGTGAGCC CCGCGGTCAG 



6 (Ml CTCCGATTGA CTGAGTCGCC CGGGTACCCG TGTATCCAAT AAACCCTCTT 
'""H GAGGCTAACT GACTCAGCGG GCCCATGGGC ACATAGGTTA TTTGGGAGAA 
lf2 . 

6(£|l GCAGTTGCAT CCGACTTGTG GTCTCGCTGT TCCTTGGGAG GGTCTCCTCT 
CGTCAACGTA GGCTGAACAC CAGAGCGACA AGGAACCCTC CCAGAGGAGA 

ij=3 _ — — — — — — — _ — — — — — _ — — — — — — . — — — — — — — - 

a~; jj 

6£G1 GAGTGATTGA CTACCCGTCA GCGGGGGTCT TTCATTCATG CAGCATGTAT 

*** CTCACTAACT GATGGGCAGT CGCCCCCAGA AAGTAAGTAC G T CG T AC AT A 



6r5l CAAAATTAAT TTGGTTTTTT TTCTTAAGTA TTTACATTAA ATGGCCATAG 
r " GTTTTAATTA AACCAAAAAA AAGAATTCAT AAATGTAATT TACCGGTATC 



6 ! g0l TTGCATTAAT GAATCGGCCA ACGCGCGGGG AGAGGCGGTT TGCGTATTGG 
O AACGTAATTA CTTAGCCGGT TGCGCGCCCC TCTCCGCCAA ACGCATAACC 

— — — — — — — — — — — — — — — — — — — — — — — — — — - - — — — — — — — — — " 

62 51 CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCTCGG TCGTTCGGCT 
GCGAGAAGGC GAAGGAGCGA GTGACTGAGC GACGCGAGCC AGCAAGCCGA 



6301 GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATCCACAG 
CGCCGCTCGC CATAGTCGAG TGAGTTTCCG CCATTATGCC AATAGGTGTC 



6351 AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG 
TTAGTCCCCT ATTGCGTCCT TTCTTGTACA CTCGTTTTCC GGTCGTTTTC 



64 01 GCCAGGAACC GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG 
CGGTCCTTGG CATTTTTCCG GCGCAACGAC CGCAAAAAGG TATCCGAGGC 



451 


CCCCCCTGAC 
GGGGGGACTG 


GAGCATCACA 
CTCGTAGTGT 


AAAATCGACG 
TTTTAGCTGC 


CTCAAGTCAG 
GAGTTCAGTC 


AGGTGGCGAA . . 
TCCACCGCTT 


501 


ACCCGACAGG 
TGGGCTGTCC 


ACTATAAAGA 
TGATATTTCT 


TACCAGGCGT 
ATGGTCCGCA 


TTCCCCCTGG 
AAGGGGGACC 


AAGCTCCCTC 
TTCGAGGGAG 


551 


GTGCGCTCTC 
CACGCGAGAG 


CTGTTCCGAC 
GACAAGGCTG 


CCTGCCGCTT 
GGACGGCGAA 


ACCGGATACC 
TGGCCTATGG 


TGTCCGCCTT 
ACAGGCGGAA 



6601 TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA TAGCTCACGC TGTAGGTATC 
AGAGGGAAGC CCTTCGCACC GCGAAAGAGT ATCGAGTGCG AC AT CC AT AG 




6651 TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC 
AGTCAAGCCA CATCCAGCAA GCGAGGTTCG ACCCGACACA CGTGCTTGGG 



6701 


CCCGTTCAGC 
GGGCAAGTCG 


CCGACCGCTG 
GGCTGGCGAC 


CGCCTTATCC 
GCGGAATAGG 


GGTAACTATC 
CCATTGATAG 


GTCTTGAGTC 
CAGAACTCAG 


6751 


CAACCCGGTA 
GTTGGGCCAT 


AGACACGACT 
TCTGTGCTGA 


TATCGCCACT 
ATAGCGGTGA 


GGCAGCAGCC 
CCGTCGTCGG 


ACTGGTAACA 
TGACCATTGT 


6801 


GGATTAGCAG 
CCTAATCGTC 


AGCGAGGTAT 
TCGCTCCATA 


GTAGGCGGTG 
CATCCGCCAC 


CTACAGAGTT 
GATGTCTCAA 


CTTGAAGTGG 
GAACTTCACC 


6851 


TGGCCTAACT 
ACCGGATTGA 


ACGGCTACAC 
TGCCGATGTG 


TAGAAGAACA 
ATCTTCTTGT 


GTATTTGGTA 
CATAAACCAT 


TCTGCGCTCT 
AGACGCGAGA 


6901 

CI 


GCTGAAGCCA 
CGACTTCGGT 


GTTACCTTCG 
CAATGGAAGC 


GAAAAAGAGT 
CTTTTTCTCA 


TGGTAGCTCT 
ACCATCGAGA 


TGATCCGGCA 
ACTAGGCCGT 


6951=0 

\ iTS 

* t — 


AACAAACCAC 
TTGTTTGGTG 


CGCTGGTAGC 
GCGACCATCG 


GGTGGTTTTT 
CCACCAAAAA 


TTGTTTGCAA 
AACAAACGTT 


GCAGCAGATT 
CGTCGTCTAA 


700iD 


ACGCGCAGAA 
TGCGCGTCTT 


AAAAAGGATC 
TTTTTCCTAG 


TCAAGAAGAT 
AGTTCTTCTA 


CCTTTGATCT 
GGAAACTAGA 


TTTCTACGGG 
AAAGATGCCC 


705EJ 


GTCTGACGCT 
CAGACTGCGA 


CAGTGGAACG 
GTCACCTTGC 


AAAACTCACG 
TTTTGAGTGC 


TTAAGGGATT 
AATTCCCTAA 


TTGGTCATGA 
AACCAGTACT 


710C 


GATTATCAAA 
CTAATAGTTT 


AAGGATCTTC 
TTCCTAGAAG 


ACCTAGATCC 
TGGATCTAGG 


TTTTAAATTA 
AAAATTTAAT 


AAAATGAAGT 
TTTTACTTCA 


Fc"s 

r » 

715:31" 

L.,-5 


TTGCGGCCGC 
AACGCCGGCG 


AAATCAATCT 
TTTAGTTAGA 


AAAGTATATA 
TTTCATATAT 


TGAGTAAACT 
ACTCATTTGA 


TGGTCTGACA 
ACCAGACTGT 


7201 


GTTACCAATG 
CAATGGTTAC 


CTTAATCAGT 
GAATTAGTCA 


GAGGCACCTA 
CTCCGTGGAT 


TCTCAGCGAT 
AGAGTCGCTA 


CTGTCTATTT 
GACAGATAAA 


7251 


CGTTCATCCA 
GCAAGTAGGT 


TAGTTGCCTG 
ATCAACGGAC 


ACTCCCCGTC 
TGAGGGGCAG 


GTGTAGATAA 
CACATCTATT 


CTACGATACG 
GATGCTATGC 


7301 


GGAGGGCTTA 
CCTCCCGAAT 


CCATCTGGCC 
GGTAGACCGG 


CCAGTGCTGC 
GGTCACGACG 


AATGATACCG 
TTACTATGGC 


CGAGAGCCAC 
GCTCTGGGTG 


7351 


GCTCACCGGC 
CGAGTGGCCG 


TCCAGATTTA 
AGGTCTAAAT 


TCAGCAATAA 
AGTCGTTATT 


ACCAGCCAGC 
TGGTCGGTCG 


CGGAAGGGCC 
GCCTTCCCGG 


7401 


GAGCGCAGAA 
CTCGCGTCTT 


GTGGTCCTGC 
CACCAGGACG 


AACTTTATCC 
TTGAAATAGG 


GCCTCCATCC 
CGGAGGTAGG 


AGTCTATTAA 
TCAGATAATT 


7451 


TTGTTGCCGG 
AACAACGGCC 


GAAGCTAGAG 
CTTCGATCTC 


TAAGTAGTTC 
ATTCATCAAG 


GCCAGTTAAT 
CGGTCAATTA 


AGTTTGCGCA 
TCAAACGCGT 


7501 


ACGTTGTTGC 
TGCAACAACG 


CATTGCTACA 
GTAACGATGT 


GGCATCGTGG 
CCGTAGCACC 


TGTCACGCTC 
ACAGTGCGAG 


GTCGTTTGGT 
CAGCAAACCA 



7 551 ATGGCTTCAT TCAGCTCCGG TTCCCAACGA TCAAGGCGAG TTACATGATC 
TACCGAAGTA AGTCGAGGCC AAGGGTTGCT AGTTCCGCTC AATGTACTAG 



t 

( 



I 



7 601 


CCCCATGTTG 
GGGGTACAAC 


TGCAAAAAAG 
ACGTTTTTTC 


CGGTTAGCTC 
GCCAATCGAG 


CTTCGGTCCT 
GAAGCCAGGA 


CCGATCGTTG 
GGCTAGCAAC 


7651 


TCAGAAGTAA 
AGTCTTCATT 


GTTGGCCGCA 
CAACCGGCGT 


GTGTTATCAC 
CACAATAGTG 


TCATGGTTAT 
AGTACCAATA 


GGCAGCACTG 
CCGTCGTGAC 


7701 


CATAATTCTC 
GTATTAAGAG 


TTACTGTCAT 
AATGACAGTA 


GCCATCCGTA 
CGGTAGGCAT 


AGATGCTTTT 
TCTACGAAAA 


CTGTGACTGG 
GACACTGACC 


7751 


TGAGTACTCA 
ACTCATGAGT 


ACCAAGTCAT 
TGGTTCAGTA 


TCTGAGAATA 
AGACTCTTAT 


GTGTATGCGG 
CACATACGCC 


CGACCGAGTT 
GCTGGCTCAA 


7801 


GCTCTTGCCC 
CGAGAACGGG 


GGCGTCAATA 
CCGCAGTTAT 


CGGGATAATA 
GCCCTATTAT 


CCGCGCCACA 
GGCGCGGTGT 


TAGCAGAACT 
ATCGTCTTGA 


7851 


TTAAAAGTGC 
AATTTTCACG 


TCATCATTGG 
AGTAGTAACC 


AAAACGTTCT 
TTTTGCAAGA 


TCGGGGCGAA 
AGCCCCGCTT 


AACTCTCAAG 
TTGAGAGTTC 


7 9Ql 

%J 


GATCTTACCG 
CTAGAATGGC 


CTGTTGAGAT 
GACAACTCTA 


CCAGTTCGAT 
GGTCAAGCTA 


GTAACCCACT 
CATTGGGTGA 


CGTGCACCCA 
GCACGTGGGT 


- rfr - 
7 9gJ 


ACTGATCTTC 
TGACTAGAAG 


AGCATCTTTT 
TCGTAGAAAA 


ACTTTCACCA 
TGAAAGTGGT 


GCGTTTCTGG 
CGCAAAGACC 


GTGAGCAAAA 
CACTCGTTTT 


— - 

sofii 

2 


ACAGGAAGGC 
TGTCCTTCCG 


AAAATGCCGC 
TTTTACGGCG 


AAAAAAGGGA 
TTTTTTCCCT 


ATAAGGGCGA 
TATTCCCGCT 


CACGGAAATG 
GTGCCTTTAC 


8 obi 

p 


TTGAATACTC 
AACTTATGAG 


ATACTCTTCC 
TATGAGAAGG 


TTTTTCAATA 
AAAAAGTTAT 


TTATTGAAGC 
AATAACTTCG 


ATTTATCAGG 
TAAATAGTCC 


81®1 
8151 


GTTATTGTCT 
CAATAACAGA 


CATGAGCGGA 
GTACTCGCCT 


TACATATTTG 
ATGTATAAAC 


AATGTATTTA 
TTACATAAAT 


GAAAAATAAA 
CTTTTTATTT 


CAAATAGGGG 
GTTTATCCCC 


TTCCGCGCAC 
AAGGCGCGTG 


ATTTC 
TAAAG 
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Figure 13A 
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1 


CTGCAGCCTG 
GACGTCGGAC 


AATATGGGCC 
TTATACCCGG 


AAACAGGATA 
TTTGTCCTAT 


TCTGTGGTAA 
AGACACCATT 


GCAGTTCCTG 
CGTCAAGGAC 


51 


CCCCGGCTCA 
GGGGCCGAGT 


GGGCCAAGAA 
CCCGGTTCTT 


CAGATGGAAC 
GTCTACCTTG 


AGCTGAATAT 
TCGACTTATA 


GGGCCAAACA 
CCCGGTTTGT 


101 


GGATATCTGT 
CCTATAGACA 


GGTAAGCAGT 
CCATTCGTCA 


TCCTGCCCCG 
AGGACGGGGC 


GCTCAGGGCC 
CGAGTCCCGG 


AAGAACAGAT 
TTCTTGTCTA 


151 


GGTCCCCAGA 
CCAGGGGTCT 


TGCGGTCCAG 
ACGCCAGGTC 


CCCTCAGCAG 
GGGAGTCGTC 


TTTCTAGAGA 
AAAGATCTCT 


ACCATCAGAT 
TGGTAGTCTA 


201 


GTTTCCAGGG 
CAAAGGTCCC 


TGCCCCAAGG 
ACGGGGTTCC 


ACCTGAAATG 
TGGACTTTAC 


ACCCTGTGCC 
TGGGACACGG 


TTATTTGAAC 
AATAAACTTG 


251 


TAACCAATCA 
ATTGGTTAGT 


GTTCGCTTCT 
CAAGCGAAGA 


CGCTTCTGTT 
GCGAAGACAA 


CGCGCGCTTC 
GCGCGCGAAG 


TGCTCCCCGA 
ACGAGGGGCT 


^ t 


GCTCAATAAA 
CGAGTTATTT 


AGAGCCCACA 
TCTCGGGTGT 


ACCCCTCACT 
TGGGGAGTGA 


CGGGGCGCCA 
GCCCCGCGGT 


GTCCTCCGAT 
CAGGAGGCTA 


~u r 

s ^ 


TGACTGAGTC 
ACTGACTCAG 


GCCCGGGTAC 
CGGGCCCATG 


CCGTGTATCC 
GGCACATAGG 


AATAAACCCT 
TTATTTGGGA 


CTTGCAGTTG 
GAACGTCAAC 


s 


CATCCGACTT 
GTAGGCTGAA 


GTGGTCTCGC 
CACCAGAGCG 


TGTTCCTTGG 
ACAAGGAACC 


GAGGGTCTCC 
CTCCCAGAGG 


TCTGAGTGAT 
AGACTCACTA 


r 3 
451 


TGACTACCCG 
ACTGATGGGC 


TCAGCGGGGG 
AGTCGCCCCC 


TCTTTCATTT 
AGAAAGTAAA 


GGGGGCTCGT 
CCCCCGAGCA 


CCGGGATCGG 
GGCCCTAGCC 




GAGACCCCTG 
CTCTGGGGAC 


CCCAGGGACC 
GGGTCCCTGG 


ACCGACCCAC 
TGGCTGGGTG 


CACCGGGAGG 
GTGGCCCTCC 


CAAGCTGGCC 
GTTCGACCGG 


551 


AGCAACTTAT 
TCGTTGAATA 


CTGTGTCTGT 
GACACAGACA 


CCGATTGTCT 
GGCTAACAGA 


AGTGTCTATG 
TCACAGATAC 


ACTGATTTTA 
TGACTAAAAT 


601 


TGCGCCTGCG 
ACGCGGACGC 


TCGGTACTAG 
AGCCATGATC 


TTAGCTAACT 
AATCGATTGA 


AGCTCTGTAT 
TCGAGACATA 


CTGGCGGACC 
GACCGCCTGG 


651 


CGTGGTGGAA 
GCACCACCTT 


CTGACGAGTT 
GACTGCTCAA 


CTGAACACCC 
GACTTGTGGG 


GGCCGCAACC 
CCGGCGTTGG 


CTGGGAGACG 
GACCCTCTGC 


701 


TCCCAGGGAC 
AGGGTCCCTG 


TTTGGGGGCC 
AAACCCCCGG 


GTTTTTGTGG 
CAAAAACACC 


CCCGACCTGA 
GGGCTGGACT 


GGAAGGGAGT 
CCTTCCCTCA 


751 


CGATGTGGAA 
GCTACACCTT 


TCCGACCCCG 
AGGCTGGGGC 


TCAGGATATG 
AGTCCTATAC 


TGGTTCTGGT 
ACCAAGACCA 


AGGAGACGAG 
TCCTCTGCTC 


801 


AACCTAAAAC 
TTGGATTTTG 


AGTTCCCGCC 
TCAAGGGCGG 


TCCGTCTGAA 
AGGCAGACTT 


TTTTTGCTTT 
AAAAACGAAA 


CGGTTTGGAA 
GCCAAACCTT 


851 


CCGAAGCCGC 
GGCTTCGGCG 


GCGTCTTGTC 
CGCAGAACAG 


TGCTGCAGCA 
ACGACGTCGT 


TCGTTCTGTG 
AGCAAGACAC 


TTGTCTCTGT 
AACAGAGACA 


901 


CTGACTGTGT 
GACTGACACA 


TTCTGTATTT 
AAGACATAAA 


GTCTGAAAAT 
CAGACTTTTA 


TAGGGCCAGA 
ATCCCGGTCT 


CTGTTACCAC 
GACAATGGTG 



FIGURE 13B 




951 TCCCTTAAGT TTGACCTTAG GTAACTGGAA AGATGTCGAG CGGCTCGCTC 
AGGGAATTCA AACTGGAATC CATTGACCTT TCTACAGCTC GCCGAGCGAG 



001 


ACAACCAGTC 
TGTTGGTCAG 


GGTAGATGTC 
CCATCTACAG 


AAGAAGAGAC 
TTCTTCTCTG 


GTTGGGTTAC 
CAACCCAATG 


CTTCTGCTCT 
GAAGACGAGA 


051 


GCAGAATGGC 
CGTCTTACCG 


CAACCTTTAA 
GTTGGAAATT 


CGTCGGATGG 
GCAGCCTACC 


CCGCGAGACG 
GGCGCTCTGC 


GCACCTTTAA 
CGTGGAAATT 


101 


CCGAGACCTC 
GGCTCTGGAG 


ATCACCCAGG 
TAGTGGGTCC 


TTAAGATCAA 
AATTCTAGTT 


GGTCTTTTCA 
CCAGAAAAGT 


CCTGGCCCGC 
GGACCGGGCG 


151 


ATGGACACCC 
TACCTGTGGG 


AGACCAGGTC 
TCTGGTCCAG 


CCCTACATCG 
GGGATGTAGC 


TGACCTGGGA 
ACTGGACCCT 


AGCCTTGGCT 
TCGGAACCGA 



1201 TTTGACCCCC CTCCCTGGGT CAAGCCCTTT GTACACCCTA AGCCTCCGCC 
AAACTGGGGG GAGGGACCCA GTTCGGGAAA CATGTGGGAT TCGGAGGCGG 



12 5j§ TCCTCTTCCT CCATCCGCCC CGTCTCTCCC CCTTGAACCT CCTCGTTCGA 
Cs AGGAGAAGGA GGTAGGCGGG GCAGAGAGGG GGAACTTGGA GGAGCAAGCT 



13 OX CCCCGCCTCG ATCCTCCCTT TATCCAGCCC TCACTCCTTC TCTAGGCGCC 
^ GGGGCGGAGC TAGGAGGGAA ATAGGTCGGG AGTGAGGAAG AGATCCGCGG 



13 34? GGCCGCTCTA GCCCATTAAT ACGACTCACT ATAGGGCGAT TCGAACACCA 
RJ CCGGCGAGAT CGGGTAATTA TGCTGAGTGA TATCCCGCTA AGCTTGTGGT 



14<yj TGCACCATCA TCATCATCAC GTCGACGAAC AGAAACTCAT TTCCGAAGAA 
M, ACGTGGTAGT AGTAGTAGTG CAGCTGCTTG TCTTTGAGTA AAGGCTTCTT 



s 

n 


GACCTACTCG 
CTGGATGAGC 


AGATGGGCGT 
TCTACCCGCA 


GATTACGGAT 
CTAATGCCTA 


TCACTGGCCG 
AGTGACCGGC 


TCGTTTTACA 
AGCAAAATGT 




ACGTCGTGAC 
TGCAGCACTG 


TGGGAAAACC 
ACCCTTTTGG 


CTGGCGTTAC 
GACCGCAATG 


CCAACTTAAT 
GGTTGAATTA 


CGCCTTGCAG 
GCGGAACGTC 


551 


CACATCCCCC 
GTGTAGGGGG 


TTTCGCCAGC 
AAAGCGGTCG 


TGGCGTAATA 
ACCGCATTAT 


GCGAAGAGGC 
CGCTTCTCCG 


CCGCACCGAT 
GGCGTGGCTA 


601 


CGCCCTTCCC 
GCGGGAAGGG 


AACAGTTACG 
TTGTCAATGC 


CAGCCTGAAT 
GTCGGACTTA 


GGCGAATGGC 
CCGCTTACCG 


GCTTTGCCTG 
CGAAACGGAC 


651 


GTTTCCGGCA 
CAAAGGCCGT 


CCAGAAGCGG 
GGTCTTCGCC 


TGCCGGAAAG 
ACGGCCTTTC 


CTGGCTGGAG 
GACCGACCTC 


TGCGATCTTC 
ACGCTAGAAG 


701 


CTGAGGCCGA 
GACTCCGGCT 


TACTGTCGTC 
ATGACAGCAG 


GTCCCCTCAA 
CAGGGGAGTT 


ACTGGCAGAT 
TGACCGTCTA 


GCACGGTTAC 
CGTGCCAATG 


751 


GATGCGCCCA 
CTACGCGGGT 


TCTACACCAA 
AGATGTGGTT 


CGTGACCTAT 
GCACTGGATA 


CCCATTACGG 
GGGTAATGCC 


TCAATCCGCC 
AGTTAGGCGG 



1801 GTTTGTTCCC ACGGAGAATC CGACGGGTTG TTACTCGCTC ACATTTAATG 
CAAACAAGGG TGCCTCTTAG GCTGCCCAAC AATGAGCGAG TGTAAATTAC 



1851 TTGATGAAAG CTGGCTACAG GAAGGCCAGA CGCGAATTAT TTTTGATGGC 
AACTACTTTC GACCGATGTC CTTCCGGTCT GCGCTTAATA AAAACTACCG 




1901 GTTAACTCGG CGTTTCATCT GTGGTGCAAC GGGCGCTGGG TCGGTTACGG 
CAATTGAGCC GCAAAGTAGA CACCACGTTG CCCGCGACCC AGCCAATGCC 



951 


CCAGGACAGT 
GGTCCTGTCA 


CGTTTGCCGT 
GCAAACGGCA 


CTGAATTTGA 
GACTTAAACT 


CCTGAGCGCA 
GGACTCGCGT 


TTTTTACGCG 
AAAAATGCGC 


001 


CCGGAGAAAA 
GGCCTCTTTT 


CCGCCTCGCG 
GGCGGAGCGC 


GTGATGGTGC 
CACTACCACG 


TGCGCTGGAG 
ACGCGACCTC 


TGACGGCAGT 
ACTGCCGTCA 


051 


TATCTGGAAG 
ATAGACCTTC 


ATCAGGATAT 
TAGTCCTATA 


GTGGCGGATG 
CACCGCCTAC 


AGCGGCATTT 
TCGCCGTAAA 


TCCGTGACGT 
AGGCACTGCA 


101 


CTCGTTGCTG 
GAGCAACGAC 


CATAAACCGA 
GTATTTGGCT 


CTACACAAAT 
GATGTGTTTA 


CAGCGATTTC 
GTCGCTAAAG 


CATGTTGCCA 
GTACAACGGT 



2151 CTCGCTTTAA TGATGATTTC AGCCGCGCTG TACTGGAGGC TGAAGTTCAG 
GAGCGAAATT ACTACTAAAG TCGGCGCGAC ATGACCTCCG ACTTCAAGTC 



2Zgf ATGTGCGGCG AGTTGCGTGA CTACCTACGG GTAACAGTTT CTTTATGGCA 

T\ TACACGCCGC TCAACGCACT GATGGATGCC CATTGTCAAA GAAATACCGT 

^4 



22'5l GGGTGAAACG CAGGTCGCCA GCGGCACCGC GCCTTTCGGC GGTGAAATTA 
"f* CCCACTTTGC GTCCAGCGGT CGCCGTGGCG CGGAAAGCCG CCACTTTAAT 



23§1 TCGATGAGCG TGGTGGTTAT GCCGATCGCG TCACACTACG TCTGAACGTC 
!U AGCTACTCGC ACCACCAATA CGGCTAGCGC AGTGTGATGC AGACTTGCAG 



23SL GAAAACCCGA AACTGTGGAG CGCCGAAATC CCGAATCTCT ATCGTGCGGT 
CTTTTGGGCT TTGACACCTC GCGGCTTTAG GGCTTAGAGA TAGCACGCCA 



2 4§% GGTTGAACTG CACACCGCCG ACGGCACGCT GATTGAAGCA GAAGCCTGCG 
CCAACTTGAC GTGTGGCGGC TGCCGTGCGA CTAACTTCGT CTTCGGACGC 



451 


ATGTCGGTTT 
TACAGCCAAA 


CCGCGAGGTG 
GGCGCTCCAC 


CGGATTGAAA 
GCCTAACTTT 


ATGGTCTGCT 
TACCAGACGA 


GCTGCTGAAC 
CGACGACTTG 


501 


GGCAAGCCGT 
CCGTTCGGCA 


TGCTGATTCG 
ACGACTAAGC 


AGGCGTTAAC 
TCCGCAATTG 


CGTCACGAGC 
GCAGTGCTCG 


ATCATCCTCT 
TAGTAGGAGA 


551 


GCATGGTCAG 
CGTACCAGTC 


GTCATGGATG 
CAGTACCTAC 


AGCAGACGAT 
TCGTCTGCTA 


GGTGCAGGAT 
CCACGTCCTA 


ATCCTGCTGA 
TAGGACGACT 


601 


TGAAGCAGAA 
ACTTCGTCTT 


CAACTTTAAC 
GTTGAAATTG 


GCCGTGCGCT 
CGGCACGCGA 


GTTCGCATTA 
CAAGCGTAAT 


TCCGAACCAT 
AGGCTTGGTA 


651 


CCGCTGTGGT 
GGCGACACCA 


ACACGCTGTG 
TGTGCGACAC 


CGACCGCTAC 
GCTGGCGATG 


GGCCTGTATG 
CCGGACATAC 


TGGTGGATGA 
ACCACCTACT 


701 


AGCCAATATT 
TCGGTTATAA 


GAAACCCACG 
CTTTGGGTGC 


GCATGGTGCC 
CGTACCACGG 


AATGAATCGT 
TTACTTAGCA 


CTGACCGATG 
GACTGGCTAC 



27 51 ATCCGCGCTG GCTACCGGCG ATGAGCGAAC GCGTAACGCG AATGGTGCAG 
TAGGCGCGAC CGATGGCCGC TACTCGCTTG CGCATTGCGC TTACCACGTC 



28 01 CGCGATCGTA ATCACCCGAG TGTGATCATC TGGTCGCTGG GGAATGAATC 
GCGCTAGCAT TAGTGGGCTC ACACTAGTAG ACCAGCGACC CCTTACTTAG 



t I 



2351 


AGGCCACGGC 
TCCGGTGCCG 


GCTAATCACG 
CGATTAGTGC 


ACGCGCTGTA 
TGCGCGACAT 


TCGCTGGATC 
AGCGACCTAG 


AAATCTGTCG 
TTTAGACAGC 


2901 


ATCCTTCCCG 
TAGGAAGGGC 


CCCGGTGCAG 
GGGCCACGTC 


TATGAAGGCG 
ATACTTCCGC 


GCGGAGCCGA 
CGCCTCGGCT 


CACCACGGCC 
GTGGTGCCGG 


2951 


ACCGATATTA 
TGGCTATAAT 


TTTGCCCGAT 
AAACGGGCTA 


GTACGCGCGC 
CATGCGCGCG 


GTGGATGAAG 
CACCTACTTC 


ACCAGCCCTT 
TGGTCGGGAA 


3001 


CCCGGCTGTG 
GGGCCGACAC 


CCGAAATGGT 
GGCTTTACCA 


CCATCAAAAA 
GGTAGTTTTT 


ATGGCTTTCG 
TACCGAAAGC 


CTACCTGGAG 
GATGGACCTC 


3051 


AGACGCGCCC 
TCTGCGCGGG 


GCTGATCCTT 
CGACTAGGAA 


TGCGAATACG 
ACGCTTATGC 


CCCACGCGAT 
GGGTGCGCTA 


GGGTAACAGT 
CCCATTGTCA 


3101 


CTTGGCGGTT 
GAACCGCCAA 


TCGCTAAATA 
AGCGATTTAT 


CTGGCAGGCG 
GACCGTCCGC 


TTTCGTCAGT 
AAAGCAGTCA 


ATCCCCGTTT 
TAGGGGCAAA 


Nl 


ACAGGGCGGC 
TGTCCCGCCG 


TTCGTCTGGG 
AAGCAGACCC 


ACTGGGTGGA 
TGACCCACCT 


TCAGTCGCTG 
AGTCAGCGAC 


ATTAAATATG 
TAATTTATAC 


32% 

5 i s 


ATGAAAACGG 
TACTTTTGCC 


CAACCCGTGG 
GTTGGGCACC 


TCGGCTTACG 
AGCCGAATGC 


GCGGTGATTT 
CGCCACTAAA 


TGGCGATACG 
ACCGCTATGC 


: Est 

32^1 


CCGAACGATC 
GGCTTGCTAG 


GCCAGTTCTG 
CGGTCAAGAC 


TATGAACGGT 
ATACTTGCCA 


CTGGTCTTTG 
GACCAGAAAC 


CCGACCGCAC 
GGCTGGCGTG 


33ga 


GCCGCATCCA 
CGGCGTAGGT 


GCGCTGACGG 
CGCGACTGCC 


AAGCAAAACA 
TTCGTTTTGT 


CCAGCAGCAG 
GGTCGTCGTC 


TTTTTCCAGT 
AAAAAGGTCA 


33#1 


TCCGTTTATC 
AGGCAAATAG 


CGGGCAAACC 
GCCCGTTTGG 


ATCGAAGTGA 
TAGCTTCACT 


CCAGCGAATA 
GGTCGCTTAT 


CCTGTTCCGT 
GGACAAGGCA 




CATAGCGATA 
GTATCGCTAT 


ACGAGCTCCT 
TGCTCGAGGA 


GCACTGGATG 
CGTGACCTAC 


GTGGCGCTGG 
CACCGCGACC 


ATGGTAAGCC 
TACCATTCGG 


3451 


GCTGGCAAGC 
CGACCGTTCG 


GGTGAAGTGC 
CCACTTCACG 


CTCTGGATGT 
GAGACCTACA 


CGCTCCACAA 
GCGAGGTGTT 


GGTAAACAGT 
CCATTTGTCA 


3501 


TGATTGAACT 
ACTAACTTGA 


GCCTGAACTA 
CGGACTTGAT 


CCGCAGCCGG 
GGCGTCGGCC 


AGAGCGCCGG 
TCTCGCGGCC 


GCAACTCTGG 
CGTTGAGACC 


3551 


CTCACAGTAC 
GAGTGTCATG 


GCGTAGTGCA 
CGCATCACGT 


ACCGAACGCG 
TGGCTTGCGC 


ACCGCATGGT 
TGGCGTACCA 


CAGAAGCCGG 
GTCTTCGGCC 


3601 


GCACATCAGC 
CGTGTAGTCG 


GCCTGGCAGC 
CGGACCGTCG 


AGTGGCGTCT 
TCACCGCAGA 


GGCGGAAAAC 
CCGCCTTTTG 


CTCAGTGTGA 
GAGTCACACT 


3651 


CGCTCCCCGC 
GCGAGGGGCG 


CGCGTCCCAC 
GCGCAGGGTG 


GCCATCCCGC 
CGGTAGGGCG 


ATCTGACCAC 
TAGACTGGTG 


CAGCGAAATG 
GTCGCTTTAC 


3701 


GATTTTTGCA 
CTAAAAACGT 


TCGAGCTGGG 
AGCTCGACCC 


TAATAAGCGT 
ATTATTCGCA 


TGGCAATTTA 
ACCGTTAAAT 


ACCGCCAGTC 
TGGCGGTCAG 


3751 


AGGCTTTCTT 
TCCGAAAGAA 


TCACAGATGT 
AGTGTCTACA 


GGATTGGCGA 
CCTAACCGCT 


TAAAAAACAA 
ATTTTTTGTT 


CTGCTGACGC 
GACGACTGCG 



3 801 CGCTGCGCGA TCAGTTCACC CGTGTCGATA GATCTGGAGG TGGTGGCAGC 
GCGACGCGCT AGTCAAGTGG GCACAGCTAT CTAGACCTCC ACCACCGTCG 



3 351 AGGCCTTGGC GCGCCGGATC CTTAATTAAC AATTGACCGG TAATAATAGG 
TCCGGAACCG CGCGGCCTAG GAATTAATTG TTAACTGGCC ATTATTATCC 



901 


TAGATAAGTG 
ATCTATTCAC 


ACTGATTAGA 
TGACTAATCT 


TGCATTTCGA 
ACGTAAAGCT 


CTAGATCCCT 
GATCTAGGGA 


CGACCAATTC 
GCTGGTTAAG 


951 


CGGTTATTTT 
GCCAATAAAA 


CCACCATATT 
GGTGGTATAA 


GCCGTCTTTT 
CGGCAGAAAA 


GGCAATGTGA 
CCGTTACACT 


GGGCCCGGAA 
CCCGGGCCTT 


001 


ACCTGGCCCT 
TGGACCGGGA 


GTCTTCTTGA 
CAGAAGAACT 


CGAGCATTCC 
GCTCGTAAGG 


TAGGGGTCTT 
ATCCCCAGAA 


TCCCCTCTCG 
AGGGGAGAGC 


051 


CCAAAGGAAT 
GGTTTCCTTA 


GCAAGGTCTG 
CGTTCCAGAC 


TTGAATGTCG 
AACTTACAGC 


TGAAGGAAGC 
ACTTCCTTCG 


AGTTCCTCTG 
TCAAGGAGAC 



41G1 GAAGCTTCTT GAAGACAAAC AACGTCTGTA GCGACCCTTT GCAGGCAGCG 
s CTTCGAAGAA CTTCTGTTTG TTGCAGACAT CGCTGGGAAA CGTCCGTCGC 



41 Si GAACCCCCCA CCTGGCGACA GGTGCCTCTG CGGCCAAAAG CCACGTGTAT 

~r* CTTGGGGGGT GGACCGCTGT CCACGGAGAC GCCGGTTTTC GGTGCACATA 

4 26S AAGATACACC TGCAAAGGCG GCACAACCCC AGTGCCACGT TGTGAGTTGG 

lU TTCTATGTGG ACGTTTCCGC CGTGTTGGGG TCACGGTGCA ACACTCAACC 



4 2iJ ATAGTTGTGG AAAGAGTCAA ATGGCTCTCC TCAAGCGTAT TCAACAAGGG 
U TATCAACACC TTTCTCAGTT TACCGAGAGG AGTTCGCATA AGTTGTTCCC 



3ffl 


GCTGAAGGAT 
CGACTTCCTA 


GCCCAGAAGG 
CGGGTCTTCC 


TACCCCATTG 
ATGGGGTAAC 


TATGGGATCT 
ATACCCTAGA 


GATCTGGGGC 
CTAGACCCCG 


351 


CTCGGTGCAC 
GAGCCACGTG 


ATGCTTTACA 
TACGAAATGT 


TGTGTTTAGT 
ACACAAATCA 


CGAGGTTAAA 
GCTCCAATTT 


AAACGTCTAG 
TTTGCAGATC 


401 


GCCCCCCGAA 
CGGGGGGCTT 


* 

CCACGGGGAC 
GGTGCCCCTG 


GTGGTTTTCC 
CACCAAAAGG 


TTTGAAAAAC 
AAACTTTTTG 


ACGATGATAA 
TGCTACTATT 


451 


TACCATGAAA 
ATGGTACTTT 


AAGCCTGAAC 
TTCGGACTTG 


TCACCGCGAC 
AGTGGCGCTG 


GTCTGTCGAG 
CAGACAGCTC 


AAGTTTCTGA 
TTCAAAGACT 


501 


TCGAAAAGTT 
AGCTTTTCAA 


CGACAGCGTC 
GCTGTCGCAG 


TCCGACCTGA 
AGGCTGGACT 


TGCAGCTCTC 
ACGTCGAGAG 


GGAGGGCGAA 
CCTCCCGCTT 


551 


GAATCTCGTG 
CTTAGAGCAC 


CTTTCAGCTT 
GAAAGTCGAA 


CGATGTAGGA 
GCTACATCCT 


GGGCGTGGAT 
CCCGCACCTA 


ATGTCCTGCG 
TACAGGACGC 


601 


GGTAAATAGC 
CCATTTATCG 


TGCGCCGATG 
ACGCGGCTAC 


GTTTCTACAA 
CAAAGATGTT 


AGATCGTTAT 
TCTAGCAATA 


GTTTATCGGC 
CAAATAGCCG 


651 


ACTTTGCATC 
TGAAACGTAG 


GGCCGCGCTC 
CCGGCGCGAG 


CCGATTCCGG 
GGCTAAGGCC 


AAGTGCTTGA 
TTCACGAACT 


CATTGGGGAA 
GTAACCCCTT 


701 


TTTAGCGAGA 
AAATCGCTCT 


GCCTGACCTA 
CGGACTGGAT 


TTGCATCTCC 
AACGTAGAGG 


CGCCGTGCAC 
GCGGCACGTG 


AGGGTGTCAC 
TCCCACAGTG 




4 751 GTTGCAAGAC CTGCCTGAAA CCGAACTGCC CGCTGTTCTG CAGCCGGTCG 
CAACGTTCTG GACGGACTTT GGCTTGACGG GCGACAAGAC GTCGGCCAGC 



4801 


CGGAGGCCAT 
GCCTCCGGTA 


• 

GGATGCGATC 
CCTACGCTAG 


GCTGCGGCCG 
CGACGCCGGC 


ATCTTAGCCA 
TAGAATCGGT 


GACGAGCGGG 
CTGCTCGCCC 


4851 


TTCGGCCCAT 
AAGCCGGGTA 


TCGGACCGCA 
AGCCTGGCGT 


AGGAATCGGT 
TCCTTAGCCA 


CAATACACTA 
GTTATGTGAT 


CATGGCGTGA 
GTACCGCACT 


4901 


TTTCATATGC 
AAAGTATACG 


GCGATTGCTG 
CGCTAACGAC 


ATCCCCATGT 
TAGGGGTACA 


GTATCACTGG 
CATAGTGACC 


CAAACTGTGA 
GTTTGACACT 


4951 


TGGACGACAC 
ACCTGCTGTG 


CGTCAGTGCG 
GCAGTCACGC 


TCCGTCGCGC 
AGGCAGCGCG 


AGGCTCTCGA 
TCCGAGAGCT 


TGAGCTGATG 
ACTCGACTAC 


5001 


CTTTGGGCCG 
GAAACCCGGC 


AGGACTGCCC 
TCCTGACGGG 


CGAAGTCCGG 
GCTTCAGGCC 


CACCTCGTGC 
GTGGAGCACG 


ACGCGGATTT 
TGCGCCTAAA 


503ffl 


CGGCTCCAAC 
GCCGAGGTTG 


AATGTCCTGA 
TTACAGGACT 


CGGACAATGG 
GCCTGTTACC 


CCGCATAACA 
GGCGTATTGT 


GCGGTCATTG 
CGCCAGTAAC 


5l( ® 


ACTGGAGCGA 
TGACCTCGCT 


GGCGATGTTC 
CCGCTACAAG 


GGGGATTCCC 
CCCCTAAGGG 


AATACGAGGT 
TTATGCTCCA 


CGCCAACATC 
GCGGTTGTAG 


In 

sia 

w 


TTCTTCTGGA 
AAGAAGACCT 


GGCCGTGGTT 
CCGGCACCAA 


GGCTTGTATG 
CCGAACATAC 


GAGCAGCAGA 
CTCGTCGTCT 


CGCGCTACTT 
GCGCGATGAA 


52(0 


CGAGCGGAGG 
GCTCGCCTCC 


CATCCGGAGC 
GTAGGCCTCG 


TTGCAGGATC 
AACGTCCTAG 


GCCGCGGCTC 
CGGCGCCGAG 


CGGGCGTATA 
GCCCGCATAT 


5251 

o 


TGCTCCGCAT 
ACGAGGCGTA 


TGGTCTTGAC 
ACCAGAACTG 


CAACTCTATC 
GTTGAGATAG 


AGAGCTTGGT 
TCTCGAACCA 


TGACGGCAAT 
ACTGCCGTTA 


5301 


TTCGATGATG 
AAGCTACTAC 


CAGCTTGGGC 
GTCGAACCCG 


GCAGGGTCGA 
CGTCCCAGCT 


TGCGACGCAA 
ACGCTGCGTT 


TCGTCCGATC 
AGCAGGCTAG 


5351 


CGGAGCCGGG 
GCCTCGGCCC 


ACTGTCGGGC 
TGACAGCCCG 


GTACACAAAT 
CATGTGTTTA 


CGCCCGCAGA 
GCGGGCGTCT 


AGCGCGGCCG 
TCGCGCCGGC 


5401 


TCTGGACCGA 
AGACCTGGCT 


TGGCTGTGTA 
ACCGACACAT 


GAAGTACTCG 
CTTCATGAGC 


CCGATAGTGG 
GGCTATCACC 


AAACCGACGC 
TTTGGCTGCG 


5451 


CCCAGCACTC 
GGGTCGTGAG 


GTCCGAGGGC 
CAGGCTCCCG 


AAAGGAATAG 
TTTCCTTATC 


AGTAGATGCC 
TCATCTACGG 


GACCGGGATC 
CTGGCCCTAG 


5501 


TATCGATAAA 
ATAGCTATTT 


ATAAAAGATT 
TATTTTCTAA 


TTATTTAGTC 
AATAAATCAG 


T CC AG AAAAA 
AGGTCTTTTT 


GGGGGGAATG 
CCCCCCTTAC 


5551 


AAAGACCCCA 
TTTCTGGGGT 


CCTGTAGGTT 
GGACATCCAA 


TGGCAAGCTA 
ACCGTTCGAT 


GCTTAAGTAA 
CGAATTCATT 


CGCCATTTTG 
GCGGTAAAAC 


5601 


CAAGGCATGG 
GTTCCGTACC 


AAAAATACAT 
TTTTTATGTA 


AACTGAGAAT 
TTGACTCTTA 


AGAGAAGTTC 
TCTCTTCAAG 


AGATCAAGGT 
TCTAGTTCCA 



5651 CAGGAACAGA TGGAACAGCT GAATATGGGC CAAACAGGAT ATCTGTGGTA 
GTCCTTGTCT ACCTTGTCGA CTTATACCCG GTTTGTCCTA TAGACACCAT 



5701 


AGCAGTTCCT 
TCGTCAAGGA 


GCCCCGGCTC 
CGGGGCCGAG 


AGGGCCAAGA 
TCCCGGTTCT 


ACAGATGGAA 
TGTCTACCTT 


CAGCTGAATA 
GTCGACTTAT 


5751 


TGGGCCAAAC 
ACCCGGTTTG 


AGGATATCTG 
TCCTATAGAC 


TGGTAAGCAG 
ACCATTCGTC 


TTCCTGCCCC 
AAGGACGGGG 


GGCTCAGGGC 
CCGAGTCCCG 


5801 


CAAGAACAGA 
GTTCTTGTCT 


TGGTCCCCAG 
ACCAGGGGTC 


ATGCGGTCCA 
TACGCCAGGT 


GCCCTCAGCA 
CGGGAGTCGT 


GTTTCTAGAG 
CAAAGATCTC 


5851 


AACCATCAGA 
TTGGTAGTCT 


TGTTTCCAGG 
ACAAAGGTCC 


GTGCCCCAAG 
CACGGGGTTC 


GACCTGAAAT 
CTGGACTTTA 


GACCCTGTGC 
CTGGGACACG 


5901 


CTTATTTGAA 
GAATAAACTT 


CTAACCAATC 
GATTGGTTAG 


AGTTCGCTTC 
TCAAGCGAAG 


TCGCTTCTGT 
AGCGAAGACA 


TCGCGCGCTT 
AGCGCGCGAA 


5951 


CTGCTCCCCG 
GACGAGGGGC 


AGCTCAATAA 
TCGAGTTATT 


AAGAGCCCAC 
TTCTCGGGTG 


AACCCCTCAC 
TTGGGGAGTG 


TCGGGGCGCC 
AGCCCCGCGG 


6 Op. 

^* ft 


AGTCCTCCGA 
TCAGGAGGCT 


TTGACTGAGT 
AACTGACTCA 


CGCCCGGGTA 
GCGGGCCCAT 


CCCGTGTATC 
GGGCACATAG 


CAATAAACCC 
GTTATTTGGG 


60li 

7 ft. 


TCTTGCAGTT 
AGAACGTCAA 


GCATCCGACT 
CGTAGGCTGA 


TGTGGTCTCG 
ACACCAGAGC 


CTGTTCCTTG 
GACAAGGAAC 


GGAGGGTCTC 
CCTCCCAGAG 


erii 


CTCTGAGTGA 
GAGACTCACT 


TTGACTACCC 
AACTGATGGG 


GTCAGCGGGG 
CAGTCGCCCC 


GTCTTTCATT 
CAGAAAGTAA 


CATGCAGCAT 
GTACGTCGTA 


61151 


GTATCAAAAT 
CATAGTTTTA 


TAATTTGGTT 
ATTAAACCAA 


TTTTTTCTTA 
AAAAAAGAAT 


AGTATTTACA 
TCATAAATGT 


TTAAATGGCC 
AATTTACCGG 


62p. 


ATAGTTGCAT 
TATCAACGTA 


TAATGAATCG 
ATTACTTAGC 


GCCAACGCGC 
CGGTTGCGCG 


GGGGAGAGGC 
CCCCTCTCCG 


GGTTTGCGTA 
CCAAACGCAT 


62*51 


TTGGCGCTCT 
AACCGCGAGA 


TCCGCTTCCT 
AGGCGAAGGA 


CGCTCACTGA 
GCGAGTGACT 


CTCGCTGCGC 
GAGCGACGCG 


TCGGTCGTTC 
AGCCAGCAAG 


6301 


GGCTGCGGCG 
CCGACGCCGC 


AGCGGTATCA 
TCGCCATAGT 


GCTCACTCAA 
CGAGTGAGTT 


AGGCGGTAAT 
TCCGCCATTA 


ACGGTTATCC 
TGCCAATAGG 


6351. 


ACAGAATCAG 
TGTCTTAGTC 


GGGATAACGC 
CCCTATTGCG 


AGGAAAGAAC 
TCCTTTCTTG 


ATGTGAGCAA 
TACACTCGTT 


AAGGCCAGCA 
TTCCGGTCGT 


6401 


AAAGGCCAGG 
TTTCCGGTCC 


AACCGTAAAA 
TTGGCATTTT 


AGGCCGCGTT 
TCCGGCGCAA 


GCTGGCGTTT 
CGACCGCAAA 


TTCCATAGGC 
AAGGTATCCG 


6451 


TCCGCCCCCC 
AGGCGGGGGG 


TGACGAGCAT 
ACTGCTCGTA 


CACAAAAATC 
GTGTTTTTAG 


GACGCTCAAG 
CTGCGAGTTC 


TCAGAGGTGG 
AGTCTCCACC 


6501 


CGAAACCCGA 
GCTTTGGGCT 


CAGGACTATA 
GTCCTGATAT 


AAGATACCAG 
TTCTATGGTC 


GCGTTTCCCC 
CGCAAAGGGG 


CTGGAAGCTC 
GACCTTCGAG 


6551 


CCTCGTGCGC 
GGAGCACGCG 


TCTCCTGTTC 
AGAGGACAAG 


CGACCCTGCC 
GCTGGGACGG 


GCTTACCGGA 
CGAATGGCCT 


TACCTGTCCG 
ATGGACAGGC 


6601 


CCTTTCTCCC 
GGAAAGAGGG 


TTCGGGAAGC 
AAGCCCTTCG 


GTGGCGCTTT 
CACCGCGAAA 


CTCATAGCTC 
GAGTATCGAG 


ACGCTGTAGG 
TGCGACATCC 



« t 

r i 



665 L 


TATCTCAGTT 
ATAGAGTCAA 


CGGTGTAGGT 
GCCACATCCA 


CGTTCGCTCC 
GCAAGCGAGG 


AAGCTGGGCT 
TTCGACCCGA 


GTGTGCACGA 
CACACGTGCT 


6701 


ACCCCCCGTT 
TGGGGGGCAA 


CAGCCCGACC 
GTCGGGCTGG 


GCTGCGCCTT 
CGACGCGGAA 


ATCCGGTAAC 
TAGGCCATTG 


TATCGTCTTG 
ATAGCAGAAC 


6751 


AGTCCAACCC 
TCAGGTTGGG 


GGTAAGACAC 
CCATTCTGTG 


GACTTATCGC 
CTGAATAGCG 


CACTGGCAGC 
GTGACCGTCG 


AGCCACTGGT 
TCGGTGACCA 


6801 


AACAGGATTA 
TTGTCCTAAT 


GCAGAGCGAG 
CGTCTCGCTC 


GTATGTAGGC 
CATAGATCCG 


GGTGCTACAG 
CCACGATGTC 


AGTTCTTGAA 
TCAAGAACTT 


6851 


GTGGTGGCCT 
CACCACCGGA 


AACTACGGCT 
TTGATGCCGA 


ACACTAGAAG 
TGTGATCTTC 


AACAGTATTT 
TTGTCATAAA 


GGTATCTGCG 
CCATAGACGC 


6901 


CTCTGCTGAA 
GAGACGACTT 


GCCAGTTACC 
CGGTCAATGG 


TTCGGAAAAA 
AAGCCTTTTT 


GAGTTGGTAG 
CTCAACCATC 


CTCTTGATCC 
GAGAACTAGG 


69y. 

'tut: 


GGCAAACAAA 
CCGTTTGTTT 


CCACCGCTGG 
GGTGGCGACC 


TAGCGGTGGT 
ATCGCCACCA 


• 

TTTTTTGTTT 
AAAAAACAAA 


GCAAGCAGCA 
CGTTCGTCGT 



7 081 GATTACGCGC AGAAAAAAAG GATCTCAAGA AGATCCTTTG ATCTTTTCTA 
S CTAATGCGCG TCTTTTTTTC CTAGAGTTCT TCTAGGAAAC TAGAAAAGAT 



7 0fl CGGGGTCTGA CGCTCAGTGG AACGAAAACT CACGTTAAGG GATTTTGGTC 
ni GCCCCAGACT GCGAGTCACC TTGCTTTTGA GTGCAATTCC CTAAAACCAG 



71g§ ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTGC GGCCGCAAAT 
r: TACTCTAATA GTTTTTCCTA GAAGTGGATC TAGGAAAACG CCGGCGTTTA 



71 £2 CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA CCAATGCTTA 
GTTAGATTTC ATATATACTC ATTTGAACCA GACTGTCAAT GGTTACGAAT 



72W ATCAGTGAGG CACCTATCTC AGCGATCTGT CTATTTCGTT CATCCATAGT 
TAGTCACTCC GTGGATAGAG TCGCTAGACA GATAAAGCAA GTAGGTATCA 



251 


TGCCTGACTC 
ACGGACTGAG 


CCCGTCGTGT 
GGGCAGCACA 


AGATAACTAC 
TCTATTGATG 


GATACGGGAG 
CTATGCCCTC 


GGCTTACCAT 
CCGAATGGTA 


301 


CTGGCCCCAG 
GACCGGGGTC 


TGCTGCAATG 
ACGACGTTAC 


ATACCGCGAG 
TATGGCGCTC 


ACCCACGCTC 
TGGGTGCGAG 


ACCGGCTCCA 
TGGCCGAGGT 


351 


GAT T TAT C AG 
CTAAATAGTC 


CAATAAACCA 
GTTATTTGGT 


GCCAGCCGGA 
CGGTCGGCCT 


AGGGCCGAGC 
TCCCGGCTCG 


GCAGAAGTGG 
CGTCTTCACC 


401 


TCCTGCAACT 
AGGACGTTGA 


TTATCCGCCT 
AATAGGCGGA 


CCATCCAGTC 
GGTAGGTCAG 


TATTAATTGT 
ATAATTAACA 


TGCCGGGAAG 
ACGGCCCTTC 


451 


CTAGAGTAAG 
GATCTCATTC 


TAGTTCGCCA 
ATCAAGCGGT 


GTTAATAGTT 
CAATTATCAA 


TGCGCAACGT 
ACGCGTTGCA 


TGTTGCCATT 
ACAACGGTAA 



7501 GCTACAGGCA TCGTGGTGTC ACGCTCGTCG TTTGGTATGG CTTCATTCAG 
CGATGTCCGT AGCACCACAG TGCGAGCAGC AAACCATACC GAAGTAAGTC 



7551 CTCCGGTTCC CAACGATCAA GGCGAGTTAC ATGATCCCCC ATGTTGTGCA 
GAGGCCAAGG GTTGCTAGTT CCGCTCAATG TACTAGGGGG TACAACACGT 




7 601 AAAAAGCGGT TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG AAGTAAGTTG 
TTTTTCGCCA AT.CGAGGAAG CCAGGAGGCT AGCAACAGTC TTCATTCAAC 



7 651 GCCGCAGTGT TATCACTCAT GGTTATGGCA GCACTGCATA ATTCTCTTAC 
CGGCGTCACA ATAGTGAGTA CCAATACCGT CGTGACGTAT TAAGAGAATG 



7 701 TGTCATGCCA TCCGTAAGAT GCTTTTCTGT GACTGGTGAG TACTCAACCA 
ACAGTACGGT AGGCATTCTA CGAAAAGACA CTGACCACTC ATGAGTTGGT 



751 


AGTCATTCTG 
TCAGTAAGAC 


AGAATAGTGT 
TCTTATCACA 


ATGCGGCGAC 
TACGCCGCTG 


CGAGTTGCTC 
GCTCAACGAG 


TTGCCCGGCG 
AACGGGCCGC 


801 


TCAATACGGG 
AGTTATGCCC 


ATAATACCGC 
TATTATGGCG 


GCCACATAGC 
CGGTGTATCG 


AGAACTTTAA 
TCTTGAAATT 


AAGTGCTCAT 
TTCACGAGTA 


851 


CATTGGAAAA 
GTAACCTTTT 


CGTTCTTCGG 
GCAAGAAGCC 


GGCGAAAACT 
CCGCTTTTGA 


CTCAAGGATC 
GAGTTCCTAG 


TTACCGCTGT 
AATGGCGACA 


9tfl. 


TGAGATCCAG 
ACTCTAGGTC 


TTCGATGTAA 
AAGCTACATT 


CCCACTCGTG 
GGGTGAGCAC 


CACCCAACTG 
GTGGGTTGAC 


ATCTTCAGCA 
TAGAAGTCGT 



7 9¥l TCTTTTACTT TCACCAGCGT TTCTGGGTGA GCAAAAACAG GAAGGCAAAA 
tQ AGAAAATGAA AGTGGTCGCA AAGACCCACT CGTTTTTGTC CTTCCGTTTT 



8 0 CO TGCCGCAAAA AAGGGAATAA GGGCGACACG GAAATGTTGA ATACTCATAC 
n\ ACGGCGTTTT TTCCCTTATT CCCGCTGTGC CTTTACAACT TATGAGTATG 



8 0 §4 TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA TTGTCTCATG 

^ AGAAGGAAAA AGTTATAATA ACTTCGTAAA TAGTCCCAAT AACAGAGTAC 

^ »^ 

81 63 AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA TAGGGGTTCC 

W*> TCGCCTATGT ATAAACTTAC ATAAATCTTT TTATTTGTTT ATCCCCAAGG 

— hzr^ — — — — — —— — — — — — — — — — — — — — — — — — — — — — — — 

8lM GCGCACATTT C 

CGCGTGTAAA G 
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